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In  order  to  estimate  the  parameters  of  predictive  models  of  an 
economic  time  series  it  is  necessary  to  specify  the  form  of  the 
stochastic  process  which  generates  the  observed  values  of  the 
series.  This  paper  suggests  using  information  on  contiguous 
changes  in  the  observed  series  in  such  structural  specification. 
A  variable  summarizing  such  information  is  defined,  and  the 
behavior  of  this  'variable  is  examined  for  various  stochastic 
processes  which  may  generate  an  observed  seri.es.  This  variable 
is  defined  so  as  to  enable  information  on  the  relationship 
between  contiguous  changes  to  be  obtained  in  terms  of  this 
variable  either  from  samples  of  observed  realizations  or  from 
subjective  data  obtained  from  decision-makers. 


1 .   Int  r oduct  ion 

Most  if  not  all  economic  variables  depend  on  other  variables .  Even 
at  infinite  cost^  however,  such  dependence  cannot  be  completely  specified  --  the 
result  of  the  economic  analog  of  the  Heisenberg  uncertainty  principle  that  It  is 
impossible  to  observe  all  elements  of  a  situation  without  the  fact  of  observation 
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affecting  at  least  some  of  the  observed  elements.  Consequently  such  dependence 
can  only  be  perceived  in  gross,  with  detail  masked  by  a  mist  of  minor  unspecified 
influences.  Taking  the  cost  of  specification  into  account,  the  mist  quickly 
tends  to  become  a  dense  fog  obscuring  even  major  relationships  among  variables. 
"The"  econometric  problem  is  to  discern  such  major  relationships  through  the 
cloud  of  other  influences  which  affect  a  body  of  data. 

In  this  paper  I  shall  be  concerned  with  a  subtroblem  within  "the" 
econometric  problem:  namely,  with  the  specification  of  the  time-dimensional 
structure  of  relationships  among  variables.  In  the  past,  it  has  commonly  been 
assumed  that  this  structure  can  be  adequately  described  by  relatively  simple 
lag  structures  between  variables  or  first  differences  of  variables.  I  believe 
that  in  many  circumstances  use  of  such  simple  models  of  time-dimensional  struc- 
ture can  lead  to  substantial  mis -specif icat ion  of  relationships,  and  therefore 
propose  to  examine  the  time -structural  aspect  of  the  general  specification 
problem  more  closely. 

The  general  specification  problem  really  involves  two  specification 
problems;   (l)  specifying  a  conceptual  model  of  the  relationships  between  a 
variable  and  other  variables  which  influence  it,  and  (2)  specifying  a  model  of 
the  correspondences  between  the  variables  specified  in  the  conceptual  model  and 
variates  observable  in  the  real  world.  Both  problems  are  involved  in  the  sub- 
problem  of  specifying  time-dimensional  structure,,  which  can  be  defined  more 
specifically  as  the  problem  of  specifying  the  time-pattern  of  changes  in  a 
variable  which  are  generated  by  a  given  change  in  another  variable.  It  is 
necessary  both  to  specify  a  conceptual  model  describing  the  time-pattern  of 
this  dependence  and  to  specify  the  empirical  meaning  of  the  changes  in  terms  of 


which  the  structural  model  is  defined.  Since  changes  generated  by  an  identifi- 
able change  in  another  variable  cannot  be  observed  apart  from  changes  generated 
by  other  changes  in  that  variable  or  by  changes  in  other  variables,  the  second 
part  of  the  general  specification  problem  is  as  critical  as  the  first.  In  order 
to  identify  the  time-dimensional  structure  of  a  dependence  of  one  variable  on 
another,  it  is  necessary  first  to  specify  the  time  pattern  of  the  relationship 
between  the  dependent  variable  and  independent  variables  not  included  in  the 
model  to  be  specified. 

In  this  paper,  I  shall  be  concerned  exclusively  with  this  "prerequisite" 
specification,  which  may  be  termed  the  specification  of  the  time-dimensional  struc- 
ture of  unspecified  relationships  between  a  variable  and  other  variables.  I  shall 
discuss  the  nature  of  this  specification  in  more  detail  in  the  next  section,  and 
will  suggest  a  new  method  of  analysis  which  can  be  used  both  to  supplement  current 
analyses  of  observed  values  of  a  time  series  and  to  incorporate  subjective  data. 
I  shall  then  go  on  in  further  sections  to  develop  this  method  of  analysis. 

The  pragmatic  justification  for  developing  such  analytic  techniques 
need  hardly  be  discussed.  The  importance  of  specifying  the  time -dimensional 
structure  of  economic  relationships  is  becoming  increasingly  apparent  for  numer- 
ous economic  models  which  attempt  to  describe  the  effects  of  actions  taken  by 
decision  units  in  the  economy.  It  is  not  difficult  to  enumerate  a  by  no  means 
exhaustive  list  of  examples  of  problems  in  which  such  specif ication  is  important: 
explaining  the  behavior  of  aggregate  consumption,  analyzing  the  nature  of  reac- 
tions to  changes  in  the  price  of  a  particular  commodity,  describing  the  dividend 
policies  of  corporation  managers,  deciding  upon  the  storage  capacity  of  a  dam 
to  be  built  for  flood-control  purposes,  forecasting  changes  in  a  manufacturer's 
sales  or  inventory,  and  so  forth.  In  all  these  applications,  it  is  essential  to 


specify,  insofar  as  is  possible,  the  time-dimensional  character  of  the  variables 
which  enter  each  model  as  inputs  to  the  decision  process,  quite  apart  from  the 
problem  of  specifying  the  nature  of  the  relationships  between  those  variables 
and  the  decision  variable. 
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tzl Analyzing  the  time  structure  of  "independent"  variables 


In  order  to  deal  with  the  simplest  case,  I  shall  define  the 
"prerequisite"  time -structure  specification  problem  in  terms  of  a  variable 
which  is  not  clearly  dependent  on  any  other  variable,  so  that  changes  in  t 
variable  are  determined  by  influences  which  are  unidentified  and  hence  unpre- 
dictable.  In  the  face  of  the  unpredictability  of  these  influences,  the  "safe 
specification  that  can  be  made  -  safest  in  terms  of  minimax  criteria  -  is  t 
postulate  that  changes  in  the  sum  of  their  effects  are  realizations  of  a  pure 
random  process.   There  may  well  be  error  in  such  specification,  of  course. 
But  even  were  it  possible  to  know  with  certainty  the  magnitude  of  such  er3 
in  any  given  period,  the  most  accurate  specification  that  can  be  made  in  the 
absence  of  information  about  the  direction  of  that  error  still  would  be  merely 
to  flip  a  fair  coin  to  "predict"  in  which  direction  the  change  of  known  magn 


tude  will  be.1 


If  some  of  the  influences  upon  a  variable  are  identifiable,  then 
clearly  it  would  be  erroneous  to  postulate. that  changes  in  the  sum  of  the 
of  all  influences  were  random,  since  it  would  then  b. 
portion  of  those  changes  given  prior  info:  anges  in  the  identii 

iuences.   It  may  be  possible,  for  instance,  to  regard  e 
orders  for  a  company's  product  as  partially  determined  by  previous  chan 


On  thlt  Z„tS  ,P  It  t^  thS  SaVaSS  aXi°mS  3re  Valid  for  normative  purposes 
«m!  t  /°  '  the  d}scus^on  of  the  Knightian  distinction  between  measur- 
Raiffat  TiUnmea+Surable"  <sic?)  uncertainty  in  Ellsberg  [21],  along  with  Soward 
Raiffa's  perceptive  comments  in  [56]. 
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or  in  the  company's  expenditures  on.   advertising.  If  so,  it  should  then  prove 
possible  to  improve  predictions  of  changes  in  the  new  order  rate  over  those 
obtained  by  simply  assuming  all  future  changes  in  influences  on  orders  to  be 
realizations  of  a  random  process  and  on  that  basis  extrapolating  the  stochastic 
process  of  which  new  orders  are  specified  to  be  realizations.  To  do  so,  it  will 
be  necessary  to  specify  the  time-dimensional  structure  of  the  relationship  between 
the  variable  and  the  identified  influence  as  well  as  of  the  relationship  between 
the  variable  and  unidentified  influences.  I  shall  not  discuss  such  simultaneous 
specification  and  estimation  in  this  paper,  but  will  instead  restrict  the  analysis 
to  specification  of  the  time-dimensional  structure  of  the  relationship  between  a 
variable  and  influences  which  are  assumed  to  be  all  unidentified.   I  shall 
thus  restrict  myself  to  the  specification  aspects  of  the  classic  problem  of  time 
series  analysis. 

Since  any  lime  series  is  but  a  sample  realization  of  some  stochastic 
process,  the  essence  of  the  problem  of  either  predicting  or  describing  such  a 
series  is  to  describe  the  underlying  stochastic  process.  Since  any  stochastic 
process  can  in  turn  be  defined  as  a  transformation  of  a  purely  random  process, 
time  series  analysis  may  thus  be  defined  as,  in  essence,  the  estimation  of  the 

nature  of  this  transformation  from  a  set  of  observations  on  the  economic  time 

i 

Where  the  time  pattern  of  effects  of  an  identified  influence  cannot  be  specx- 

fied  apart  from  that  of  unidentified  influences,  information  about  forthcoming 
events  can  still  be  incorporated  into  a  forecast  by  simply  equating  the  expected 
value  of  a  future  "random"  shock  to  the  expected  total  effect  of  the  identified 
influence  then  current  and  assuming  the  time  pattern  of  effects  of  that  influ- 
ence to  be  the  same  as  that  of  unidentified  influences.  By  specifying  a  model 
of  a  time  series'  structure,  it  is  thus  possible  for  a  manager  to  incorporate 
information  about  future  shocks  explicitly  into  that  model's  forecasts,  rather 
than  using  that  information  as  the  basis  either  for  "fudging"  the  model's  pre- 
diction or  for  avoiding  the  model  entirely. 
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l 
series.   Definition  of  this  transformation  provides  a  formal  definition  of  the 

structure  of  the  relationship  between  a  variable  and  the  sum  of  all  influences 
upon  it  where  those  influences  are  unidentified. 

The  foundation  of  classic  time-series  analysis  may  thus  be  viewed  as 
the  postulated  existence  of  a  purely  random  process  defining  a  sequence  of  random 
shocks  j£(j);  j£T}  which  are  independently,  identically,  and  normally  distri- 
buted for  all  values  of  the  index  set  T,  with  zero  expected  value  and  unit  var- 
iance. This  set  of  random  shocks  by  definition  is  a  strictly  stationary  stochas- 
tic process,  corresponding  to  what  (in  the  continuous  case)  a  communications 
engineer  would  term  "pure  white  noise."  Defining,  then  a  transformation  f 
(not  necessarily  linear)  such  that 

(2.D  {X(t)j  t£T>        =       r^Q)i  JE.f,  J-t} 

it  is  necessary  to  be  able  to  estimate  ^from  the  observed  realization  of  the 
stochastic  process  X(t).  If  this  transformation  is  linear  and  X(t)  is  stationary, 
then  spectral  analysis  provides  a  relatively  powerful  means  of  estimation.2 
(Satisfactory  estimation  techniques  have  not  yet  been  developed  to  estimate  non- 
linear transformations . ) 

Time  series  analysis  may  be  defined  in  this  manner.  Any  stochastic  process 
can  be  written  as  a  transformation  of  a  purely  random  process.  To  state  that 
this  definition  is  useful  is  another  matter,  however,  involving  as  it  must 
some  statement  about  the  inverse  readability  of  the  process  as  so  defined. 
It  can  be  conjectured  that  it  is  useful  to  regard  the  prediction  problem  as 
that  of  estimating  the  nature  of  the  transformation  defined  in  equation  (2.1). 
Norbert  Wiener  has  taken  essentially  this  approach  ir  [70].  As  Davis  [l7j[l8J  has 
demonstrated,  it  can  be  shown  that  optimal  linear  prediction  can  be  regarded,  as 
conjectured,  as  a  measurement  of  state  variables  in  a  white  noise  excised  model 
and  of  their  initial  condition  decay.  Extension  of  this  result  to  non-linear 
systems  is  not  facile.  However,  since  all  models  dealt  with  in  this  paper  are 
linear  m  the  purely  random  process,  Davis'  result  is  valid  for  these  models. 
2 

See  for  instance  Grenander  and  Rosenblatt  [29],  Parzen  [<fr,    chapter  5], 
Rosenblatt  [60,  chapter  7],  and  Whittle  [69].  Stigum  [65'1  generalizes  this 
means  of  estimation  to  include  certain  classes  of  non- stationary  processes.  For 
a  brief  exposition  of  the  interrelationship  between  the  statistical  analysis  of 
^stationary)  time  series  and  the  theory  of  stochastic  processes  possessing  finite 
second  moments,  see  Parzen  [55]  or  Bartlett  [5]. 
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However,  as  Bernt  Stigum  has  shown  in  an  important  unpublished  paper 
[65 h  it  is  not  generally  possible  to  specify  a  unique  transformation  from  the 
spectrum  derived  from  a  given  set  of  time  series  observations.  Bayesian  estima- 
tion is  consequently  necessary  in  the  analysis  of  time  series;  that  is,  even 
given  that  such  a  transformation  would  be  linear,  it  is  necessary  to  specify  the 
form  of  the  relevant  transformation  on  the  basis  of  economic  theory  and  other 
"prior"  knowledge.   The  immediate  purpose  of  this  paper  is  twofold:   (l)  to 
classify  the  stochastic  processes  implicitly  specified  by  various  economic  models 
in  terms  of  narrower  specifications  of  the  t  -transformation  defined  in  (2.1), 
and  (2)  to  suggest  a  means  of  utilizing  not  only  objective  sample  data  in  speci- 
fying t  but  also  expectational  and  interview  data  on  decision-makers '  own 
evaluations  of  the  nature  of  the  stochastic  processes  underlying  time  series  in 
which  they  are  interested.  I  shall  not  concern  myself  with  the  problem  of  esti- 
mating r  once  the  nature  of  j    is  known,  but  shall  confine  myself  purely  to  the 
specification  problem.  Though  both  goals  stated  above  are  motivated  by  the  need 
to  develop  better  means  of  specifying  the  time  structure  of  economic  variables, 
the  second  should  as  a  by-product  also  provide  the  basis  for  analyzing  hypotheses 
about  decision-makers*  anticipations. 

A  substantial  body  of  information  on  economic  time  series  is  available 
only  in  terms  of  the  relationship  between  successive  changes  in  an  observed  time 
series.  Much  expectational  information,  for  instance,  is  perceived  in  this  form. 

This  position  is  of  course  in  conflict  with  the  methodological  structure  of 
Milton  Friedman  [2J],  as  indeed  is  any  Bayesian  position.  Perhaps  the  most 
useful  function  of  the  pathbreaking  papers  of  Yule  [7^]  and  Slutzky  [63]  on 
the  causation  of  so-called  "cycles"  is  to  warn  of  the  great  danger  involved 
in  relying  upon  models  of  economic  time  serif ->  which  are  not  based  on  pre- 
viously-derived "realistic"  models  of  such  series'  structure. 
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While  such  data  is  not  sufficiently  precise  to  be  worth  subjecting  to  very  heavy 
analysis,  it  may  contain  information  which  can  shed  light  on  the  structure  of  the 
time  series.  It  should  therefore  be  useful  to  develop  for  both  subject  and 
sample  data  a  means  of  evaluating  the  correspondence  between  relationships  between 
successive  changes  and  the  time-dimensional  structure  of  a  variable. 

I  suggest  that  perhaps  the  most  fruitful  way  of  examining  this  corres- 
pondence, is  to  define  a  variable  which  will  summarize  the  relationship  between 
successive  changes,  and  then  to  analyze,  for  different  time  series'  structures, 
the  relationship  between  this  variable  and  different  lengths  of  the  intervals 
over  which  changes  are  defined.  For  convenience,  I  propose  to  think  in  terms  of 
changes  centered  around  some  fixed  point  t.  Let  Ap  denote  change  in  X  over 
the  interval  between  t-R  and  t  for  an  arbitrarily-chosen  R ,  and  let  A,,  denote 
change  in  X  between  t  and  t+F  for  an  arbitrary  F.   (The  intervals  between  t-R 
and  t  and  between  t  and  t+F  may  be  termed  the  record  and  forecast  horizons, 
respectively).  Define  a  variable  7?R  such  that  a  decision-maker's  conditional 
expectation  for  Ap  given  A_  can  be  written  as 

(2'2)  "I^IV   =   7FB<V- 

I  propose  to  use  this  variable  to  describe  the  relationship  between  the  successive 
changes  denoted  by  Ap  and  AR.  Other  models  of  this  relationship  might  be  specified. 
My  ground  for  choosing  this  model  is  the  purely  pragmatic  justification  that  vir- 
tually all  quantitative  data  can  be  expressed  in  terms  of  this  model  and  that 
much  qualitative  data  can  be  expressed  only  in  terms  of  it.  By  establishing  the 
properties  of  7m   for  different  specifications  of  the  f  -transformation,  it  will 
thus  be  possible  to  use  qualitative  information  such  as  decision-makers'  subjective 
estimates  of  7^  for  different  F  and  R  which  are' formed  on  the  basis  of  their 
experience  and  their  familiarity  with  the  nature  of  the  influences  upon  the  variable 
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being  examined.  It  will  at  the  same  time  be  possible  to  estimate  the  relationship 

between  7   and  the  values  of  F  and  R  directly  from  historical  records  of 
FR 

observations  of  a  time  series .  Such  direct  estimation  is  used  in  applications 
discussed  in  Section  6  below.  But  such  direct  estimation  will  be  of  limited  value 
where  the  relevant  historical  record  is  small.   It  should  prove  useful  in  many 

applications  to  be  able  to  incorporate  subjective  information  obtained  in  a  form 

2 

consistent  with  that  of  information  derived  from  sample  data. 

Both  E  [Ap,  j  ApJ  and  Ap  may  be  defined  quite  broadly.  The  values  which 
they  may  take  on  may  be  the  set  of  all  real  numbers;   they  may  comprise  only  a 
selected  set  of  integers;  or  they  may  simply  be  the  trichotomy  consisting  of 
"up,"  "down,"  and  "no  change."  In  this  paper  I  shall  assume  both  E  [A.,  Aj  and 
A^  to  be  real  numbers;   7   is  thus  real-valued.  If  by  contrast  observations  of 
E  [Aj,  A,]  and  Ap.  are  limited  to  statements  of  direction  of  change  rather  than 

The  historical  record  may  be  small  either  because  past  data  is  available  only 

in  very  limited  quantity  or  because  portions  of  the  historical  data  are  known 

to  be  influenced  by  special  factors  which  would  introduce  estimation  bias  if 

such  data  were  incorporated  into  the  sample  from  which  7   is  estimated.  The 

notion  of  a  "relevant" historical  record  is  a  vague  concept:  what  is  meant 

is  a  historical  record  over  a  period  in  which  the  parameters  of  a  variable's 

time  structure  can  be  assumed  (l)  to  be  stable,  and  (2)  to  accord  with  those 

of  the  variable  in  subsequent  periods  for  which  structural  estimates  are  to  be 

used  for  forecasting  purposes.  A  "relevant" record  so  defined  should  exclude 

observations  known  to  contain  misleading  information;   on  this,  see  Frank 

Fisher's  pungent  remarks  in  [23,  chapter  l]. 

2 

Since  both  subjective  and  historical  information  are  then  summarized  in  the 

same  set  of  parameters  (7  ),  an  estimate  taking  both  sets  of  information 

'  FR 
into  account  may  be  formed  by  simply  combining  the  two  sets  of  parameters  esti- 
mates in  a  weighted  average.  Using  S,  H,  and  P  as  superscripts  respectively  to 
denote  subjective,  historical,  and  pooled  estimates  of  7„0,  an  analyst  could  set 

P        S    H      s   H 
7FR  =  W  rFR  +  (1"W)  7FR, 

where  the  weights  w  and  (l-w)  reflect  the  analyst's  evaluation  of  the  relative 

information  content  of  7  ^  and  7^.      (This  procedure  is  analagous  to  the  combin- 

FR       r  R  r  1 

ation  of  prior  and  posterior  probabilities  discussed  in  e.g.  L57,  chapter  12Jand 

[6l];  it  is  not  the  same  thing,  however,  since  the  subjective  estimates  being 

combined  with  sample  estimates  are  those  of  individuals  other  than  the  analyst 

and  are  themselves  derived  by  interview  or  other  sample  survey  techniques . ) 
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real  numbers,  as  in  certain  anticipations  surveys  such  as  those  conducted  by 
Dun  and  Bradstreet  or  the  IFO-Institut  fur  Wirtschaftsforschung,  the  analysis 
developed  in  this  paper  may  be  applied  to  such  observations  by  equating  each 
element  of  the  trichotomy  of  directions  with  three  intervals  defined  on  the  range 
of  real  numbers. 

In  order  to  analyze  the  behavior  of  7_  for  different  stochastic  pro- 
cesses,  I  shall  define  it  more  narrowly  as  the  expected  value  (measured  over  an 
infinite  ensemble  of  realizations  of  the  process)  of  the  slope  of  the  regression 
of  Aj,  on  Ap  constrained  to  pass  through  (0, 0),  namely,  as  the  expected  value  of 

fn   o\  -*»      ^-  J?   Tjj 

This  definition  is  easily  applicable  to  observed  data. 

Empirically  derived  data  on  7__  will  of  course  be  subject  to  error. 
Such  errors  can  be  explicitly  introduced  for  estimates  of  7  „  obtained  from 
objective  data  by  stating  equation  (2.2)  in  the  following  form: 

<2'4>  *p   "   ?FR  IV   *   ^fr 

where  u_,  the  difference  between  A,  and  E  [A„  )  A_],  includes  the  difference  between 
the  true  values  of  A„  and  the  conditional  expectation  as  well  as  errors  of  observa- 
tion in  each.  If  u^  satisfies  the  standard  Markov  theorem  assumptions  of 

The  two  numbers  which  demarcate  the  three  intervals  (or  subsets  of  the  set  of  all 
real  numbers)  corresponding  to  "up,"  "down,"  and  "no  change"  may  be  thought  of  as 
perception  thresholds.  An  illuminating  discussion  of  such  thresholds  in  terms  of 
response  functions  is  contained  in  Theil  [67;  pp.  196-199] •  While  some  specifi- 
cation error  is  introduced  by  considering  thresholds  to  be  fixed  points  rather 
than  continuous  functions  a  la  Theil,  such  error  should  in  most  cases  be  of  minor 
importance . 
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independence,  zero  expected  value,  and  homoscedasticity,  then  7.—,  is  an  unbiassed 

and  by  minimum- variance  criteria  a  "best"  estimate  of  7™,.  It  should  be  noted 

m 

that  the  relationship  between  7   and  values  of  F  and  R,  which  will  be  denoted 
by  the  function  7  (F.R),  is  intimately  related  to  the  form  of  the  autocorrelation 
function  R(L)  defined  over  lags  of  L  =  0,1,2,....  Specifically,  if  u^  satisfies 
the  Markov  theorem  assumptions  then 
EA^  =  cov(zy^),  EA^  =  var(^),  and 

,p  ..  ~  [RF  +  RR]  -  [R0  +  W 

(2'5)  7FR  =  E  7FR  =  2[Rn  -  Rj 

Consequently,  where  it  can  be  assumed  that  u_  satisfies  the  least-squares  assumptions, 
I  will,  in  deriving  the  properties  of  7(F,  R)  for  various  stochastic  processes,  really 
be  indirectly  deriving  the  nature  of  the  autocorrelation  function  for  those  pro- 
cesses. In  such  cases,  if  the  autocorrelation  function  for  a  process  has  previously 
been  derived,  I  shall  derive  the  properties  of  7(F,  R)  in  terms  of  that  function. 

I  have  justified  estimation  of  7   primarily  in  terms  of  the  usefulness 
of  developing  analytic  techniques  that  can  be  applied  to  subjective  as  well  as 
numerical  data.  However,  it  should  not  be  assumed,  where  only  historical  data 
are  available,  that  estimation  of  7(F,R)  is  redundant  once  the  autocorrelation 
function  has  been  estimated  in  cases  where  vl,  satisfies  the  Markov  assumptions. 
Though  both  R(L)  and  7(F, R)  summarize  the  same  information  in  such  cases  --  both 

It  should  be  noted  that  the  serial  coefficient  estimates  of  the  R^  implicitly 
specified  by  (2.5)  as  defining  the  estimates  of  7   for  a  given  ^observation 
sample  are  not  identical  to  the  estimates  of  the  same  R  which  will  typically 
be  obtained  from  that  sample  since  the  estimates  specified  in  (2.5)  are  defined 
over  a  subsample  including  but  N-F-R  observations  compared  with  the  N-L  obser- 
vations on  which  R^  is  typically  estimated. 
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functions  are  estimated  so  as  to  filter  out  noise  from  the  set  of  serial  coeffi- 
cient estimates  of  IL.  obtained  from  a  given  sample  --  each  summarizes  that  inform- 
ation in  a  different  manner  and  so  contains  different  information  about  the  inform- 
ation being  summarized.  Because  of  their  interdependence,  simultaneous  estimation 
of  the  two  functions  should  consequently  lead  to  better  estimation  of  both. 
I  shall  not  deal  with  possible  sources  of  bias  in  estimates  of  7 
obtained  from  either  subjective  or  objective  sample  data,  even  though,  in  the 
absence  of  proof  of  the  non-existence  of  such  error,  estimates  of  the  potential 
likelihood  of  such  error  must  be  made  in  order  for  7(F, R)  to  be  a  useful  diagnostic 
tool.   Where  7   is  derived  from  observation  of  decision-makers'  qualitative 
appraisals  of  E[ZV,  |  ApJ,  for  instance,  the  associated  errors  denoted  by  u^  clearly 
may  not  satisfy  the  least-squares  assumptions,  since  the  decision-makers'  condi- 
tional expectations  may  very  well  not  be  unbiased  --  or  in  any  way  be  "best 

2 
estimates."   Nevertheless,  I  shall  not  deal  with  this  problem  in  this  paper,  con- 
fining myself  to  an  evaluation  of  the  nature  of  the  function  y(¥, R)  which  is 
implied  by  various  specifications  of  the  1  -transformation,  defining  this  function 
as  the  relationship  between  F,  R,  and  the  expected  value  of  the  estimate  7^-,  defined 
in  equation  (2.3). 

Sampling  properties  of  estimates  obtained  from  objective  sample  data  will  be 
further  analyzed  in  [ll]  both  for  the  constrained  regression  defined  in  equation 
(2.2)  above  and  for  unconstrained  linear  regressions  of  A^  on  A,. 

2 

It  is  possible,  of  course,  to  define  unbiased  forecasts  as  "rational"  and  to 

assume  that  businessmen,  consumers,  and  other  decision-makers  in  the  economy  are 

rational  in  the  sense  of  this  definition.  This  is  not  to  say  that  this  procedure 

is  reasonable,  however.  Cf.  [^6],  [hQ],    and  the  comments  on  this  approach  in 

[10,  esp.  pp.  7-8]  and  [8]. 
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Nor  shall  I  be  concerned  with  analyzing  7(F,R)  in  terms  of  the  path 
traced  through  time  by  X(t),  even  though  this  problem  has  historically  been  of 
interest  in  the  analysis  of  business  fluctuations.  Since  even  a  random  walk  can 
generate  time  paths  that  are  sufficiently  "cyclical"  to  approximate  the  fluctua- 
tions present  in  most  economic  time  series,   the  construction  of  models  to  propa- 
gate such  "cycles"  is  not  a  particulary  interesting  activity. 

Having  indicated  what  I  will  not  do,  let  me  delineate  what  I  will  do. 
I  shall  proceed  to  analyze  the  relationship  between  7_,  defined  as  the  expected 

/\ 

value  of  the  estimate  7   defined  in  (2.3),  and  the  horizon  parameters  F  and  R  for 
different  discrete-process  models  of  X(t).  In  Section  3,  I  shall  do  this  for  a 
simple  stochastic  process  which  has  recently  seen  fairly  widepsread  use  in  economic 
models.  In  Section  k,    I  shall  then  use  the  model  defined  in  Section  3  as  the 
second  stage  of  a  two-stage  model  of  X(t),  defining  the  first  stage  of  this  model 
as  an  intermediate  stochastic  process  which  is  not  necessarily  a  pure  random  pro- 
cess but  which  can  be  written  as  a  relatively  straightforward  transformation  of 
one  which  is.   (Where  the  intermediate  process  is_  purely  random  with  zero  mean  and 
constant  variance,  the  model  of  Section  k   reduces  to  that  of  Section  3).  In 
Section  5,  I  shall  turn  to  alternate  ways  of  specifying  the  form  of  a  second  stage 
model  which  is  linear  in  the  elements  of  the  intermediate  stochastic  process,  and 
shall  analyze  the  behavior  of  7__  in  these  models. 

See  for  instance  Feller  [22,  chapter  3]  and  Kerrich  [36].  An  earlier  example  is 
provided  by  Working's  193^  paper  [73].  The  conception  of  the  effectiveness  of 
random  shocks  in  generating  "cycles"  is  due  to  Eugen  Slutzky  [63],  G.U.  Yule 
[7^],  and  Ragnar  Frisch  [28],  The  papers  of  Slutzky  and  Yule,  both  published  in 
1927,  showed  that  a  summation  of  random  shocks  produces  a  series  which  can  be 
well  described  ex  post  facto  by  a  Fourier  equation  (i.e.,  as  a  summation  of  cycles 
of  varying  frequency)  even  though  this  equation  does  not  predict  change  in  the 
observed  time  series. 
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An  illustrative  application  of  the  analyses  developed  in  Sections  3  through  5 
will  be  presented  in  Section  6. 

The  models  developed  in  Sections  3  through  5  are  all  linear  in  the  under- 
lying pure  random  process,  and  are  hence  special  cases  of  the  Yule-Slutzky  processes 
described  e.g.  by  Wold  [72].  I  suspect,  however,  that  they  are  adequate  to  describe 
most  economic  time  series.  As  will  be  seen,  the  stochastic  processes  analyzed  in 
Sections  3  through  5  are  not  all  stationary  -  some,  indeed,  not  even  in  thw  weak 
sense  of  covariance  stationarity.  While  this  does  not  make  estimation  easier,  it 
does  at  least  suggest  that  the  range  of  processes  described  below  may  be  sufficient 

to  include  most  time  series  of  interest  to  economists  and  managerial  scientists. 

1  ~~~ 

Like  the  optimality  of  T  -transform  estimation  postulated  supra,  this  claim  of 
broad  applicability  is  a  conjecture,  with  little  empirical  evidence  presently 
available  to  validate  it.  Mandelbrot  [U3]  has  suggested  that  many  economic  time 
series  are  realizations  of  a  quite  different  class  of  processes  (Pareto-Levy 
processes)  which,  in  contrast  to  the  processes  specified  below  in  Sections  3 
through  5,  do  not  satisfy  the  conditions  for  applicability  of  the  Central  Limit 
Theorem  since  finite  second  moments  do  not  exist  for  such  processes.  I  have  not 
attempted  to  analyze  r(F.R)  for  such  processes.  So  far,  little  empirical  evi- 
dence has  been  presented  to  indicate  whether  any  economic  time  series  is  real- 
ized from  a  Pareto-Levy  process,  or,  for  that  matter,  any  other  process.  Cf ., 
however,  Mandelbrot  [1*2 ][kk],   who  has  presented  some  evidence  indicating  that  both 
income  distributions  and  the  movement  of  commodity  prices  may  be  determined  by 
Pareto-Levy  processes. 

Another  class  of  processes  which  I  have  ignored  in  Sections  3  through  5  is  the 
set  of  processes  which  are  defined  as  the  sum  of  two  orthogonal  components:  a 
deterministic  mean-value  function  M(t),  defining  a  systematic  oscillation  and/ or 
polynomial  trend  over  t£T,  and  a  stochastic  process  X(t).  Specification  of 
such  processes  forms  the  basis  for  the  decomposition  approach  which  character- 
izes much  economic  time  series  analysis.  See  for  instance  the  orthogonal  poly- 
nomial approach  of  Hald  [30]  and  the  alternative  attempts  to  measure  M(t)  by 
multiple  smoothing  suggested  by  among  others  Maverick  [^5]  and  Brown  and  Meyer 
[133  •  Wold  [72,  Theorem  7]  has  shown  that  any  stationary  process  can  be  decom- 
posed into  M(t)  and  X(t)  and  that  moreover  X(t)  is  then  linear  in  the  pure  random 
process  r  (t).  Analyses  such  as  [30],  O5],  and  [13]  usually  assume  that 
Xfa)  =  ■  (t)'>   for  a  general  analysis  of  Wold-decomposable  processes  are  Rosenblatt 
I  59J°  I  suspect,  however,  that  most  if  not  all  economic  processes  are  character- 
ized by  mean-value  functions  identically  equal  to  zero.  If  this  is  so,  an  attempt 
to  measure  M(t)  rather  than  the  structure  of  X(t)  is  not  only  a  misorientation  of 
focus  but  also  is  likely  to  result  in  the  type  of  misspecification  against  which 
the  Yule  and  Slutzky  papers  warn.  The  application  to  common  stock  prices  in  [13] 
of  the  multiple-smoothing  formulae  advanced  by  Brown  and  Meyer  may  be  a  good 
example  of  the  dangers  of  such  an  approach  if  common  stock  prices  are  really  a 
random  walk  as  suggested  by  [l],  [35],  and  [58]. 
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3-     A  simple  stochastic  process 

The  stochastic  process  implicitly  specified  by  a  large  number  of  models 
advanced  in  recent  economic  literature  can  be  derived  from  the  following  special 
assumptions;   (l)  each  realization  of  the  process  is  determined  up  to  a  realiza- 
tion of  a  pure  random  process  by  the  effects  of  previous  realizations  of  that 
pure  random  process,  and  (2)  the  effect  of  any  realization  of  the  pure  random 
process  is  identical  in  all  periods  subsequent  to  that  realization  and  furthermore 
is  proportional  to  the  size  of  the  realization .  This  derivation  is  demonstrated 
in  section  3.1.  The  nature  of  the  relationship  between  \,   and  A^  in  time  series 
generated  by  such  a  stochastic  process  is  then  examined  in  section  3.2.  As  will 
be  shown  in  section  3.3,  the  stochastic  process  defined  in  section  3.1  can  be 
regarded  as  a  linear  combination  of  two  other,  more  specialized  stochastic  pro- 
cesses --a  random  walk  and  a  time  series  of  independently  and  identically  distri- 
buted variables  --  and  these  special  cases  will  be  briefly  discussed  in  sections 
3=3-  Some  implications  of  the  process  concerning  the  variance  of  X(t)  are 
analyzed  in  section  3 A, 

3.1  A  "proportional  effect"  model 

It  has  become  popular,  following  the  example  delineated  by  Milton  Friedman, 
to  regard  a  random  shock  as  having  two  components,  one  transient  and  the  other  par- 
sistent,  and  to  assume  that  the  proportibnate  importance  of  each  component  is  the 
same  for  all  random  shocks.   It  is  also  assumed  that  the  random  shocks  are  themselves 


Friedman's  most  well-known  model  is  that  presented  in  [26],  though  essentially  the 
same  model  was  described  in  an  earlier  work  in  which  Friedman  and  Simon  Kuznets  were 
co-authors  [27 ].  The  terminology  I  have  adopted  is  different  from  that  used  by 
Friedman,  who  denoted  the  two  components  into  which  random  shocks  could  be  parti- 
tioned as  "permanent"  and  "transitory. "  "Persistent"  probably  somewhat  more  fully 
connotes  the  necessary  existence  of  a  time  horizon  over  which  that  component  is 
defined . 
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realizations  of  a  pure  random  process  }^(t);  t£T>  with  mean  zero  and 

2 
variance  b  --or,  in  other  words,  that  they  are  realizations  of  a  process  which 

is  a  linear  homogeneous  transformation  of  the  process  {£(t)j  t£  t\    such  that 

(3.1.1)  I (%)    =     b[  £(t)]. 

From  the  assumption  of  equal  proportionate  importance  of  the  persistent  component 
for  each  shock,  each  shock  thus  has  an  identical  additive  effect  on  the  value  of 
X(t)  for  all  t^j.  This  effect  can  be  denoted  as  \[f  (j)],  0<  \  <  1,  where  \ 
represents  the  proportionate  size  of  the  persistent  component  of  each  random 
shock.  This  model  implicitly  assumes  that  the  observed  time  series,  X(t),  is  the 
sum  of  the  persistent  components  of  all  previous  random  shocks  together  with  both 
the  transient  and  the  persistent  components  of  the  random  shock  occurring  at  t. 
As  a  consequence 

(3-1.2)  x(t)  =  1(t)  +  \     £  %(j) 

j^t 

By  substituting  from  ( 3.1.1),  x(t)  can  be  defined  in  terms  of  the  unit-variance 
Gaussian  "white  noise"  process; 

(3.1.3)  X(t)  =  b  £  (t)  +  b  \  £   £  (j) 

j^t 

The  expected  value  of  X(t)  given  the  values  of  the  previous  random  shocks  --  the 
previously  determined  component  of  X(t),  in  other  words  --is  thus 

(3-1^)  E[x(t)  \  fc(j),  J-Ct]   =  bX   ^   £(j). 
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For  convenience,  I  shall  introduce  some  new  notation.  The  conditional  expectation 
E[x(t)  1  ^(j),  j^t]  will  be  denoted  by  the  symbol  X  ',      in  addition,  the  time 
index  will  be  subscripted  so  that  X(t)  will  be  denoted  by  X, .  Using  this  notation, 
it, is  evident  from  equation  (3,1.4)  that 

(3.1.5)  Xj   =   xt-1  +  b  x  ^ 

so  that 

(3-1-6'  h    -    Jt-i  ■  » »■  ^t-i 

Period-to-period  changes  in  the  previously  determined  component  of  a  time  series 
are  thus  defined  by  this  model  to  be  proportional  to  the  intervening  random  shock. 

This  model  is  equivalent  in  all  respects  to  the  exponentially-weighted 
moving  averages  which  have  been  specified  in  numerous  models  in  recent  economic 
literature  as  the  relevant  form  of  a  distributed  lag  structure.1  This  equivalence 
can  be  quickly  demonstrated  by  showing  from  (3.1.2)  and  the  definition  of  X.  that 
\  =  Xt  +  b  r  t  and  that  equation  (3. 1.6)  can  therefore  be  written 

(3.1.7)  ^  -  xt_±    =    x   [x^  -  xt_x] 

Changes  in  the  conditional  expected  value  of  J^  are  thus  defined  as  proportionate  to 
the  error  associated  with  the  immediately  previous  conditional  expectation.  Equation 
(3.1.7)  is  quickly  recognizable  as  the  definition  of  the  "adaptive"  expectations 

A  partial  list  of  recent  applications  of  exponentially-weighted  moving  averages 
includes  the  analyses  of  investment  behavior  by  Koyck  [37],  Eisner  [20],  and 
Kuh  [38] ;     the  demand  analyses  of  Nerlove  [50][5l];  the  analyses  of  aggregate  con- 
sumption by  Stone  and  Rowe  [66]  and  Friedman  [26];  and  the  analysis  of  monetary 
hyperinflation  by  Cagan  [lk],   A  slightly  different  formulation  of  this  model  was 
postulated  by  Lintner  [38]  in  explaining  the  relationship  between  dividends  and 
corporate  profits.  Exponentially-weighted  forecasts  have  been  advocated  as  both 
positive  and  normative  models  of  suppliers'  price  or  sales  anticipations  in  Arrow 
and  Nerlove  [2],  Brown  [12],  Forrester  [2k,    appendix  E],  Holt  [33][34],  Magee  [hi], 
Nerlove  [52] [53],  and  Winters  [71]. 
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which  have  been  postulated  by  Nerlove  and  Cagan.  Rearranging  terms  in  this 
equation, 

(3.1-9)  xt  =  x  xt_1  +  (i-\)  i       . 

This  equation  is  just  one  more  vay  of  stating  that  the  adaptive  expectations  model 
defines  "rational"  conditional  expectations  as  a  linear  combination  of  the  preceding 
value  of  a  precast  variable  and  the  conditional  expectation  of  that  value.  Iterating 
within  the  right-hand  side  of  (3.1.8),  it  is  apparent  that 

(3.1.9)  Xt  =  XtVi  +  jft-l  ^  ^V' 

The  conditional  expectation  denoted  by  X,  is  thus  an  exponentially  weighted  average 
of  all  previous  values  of  X. 

3.2  Analysis  of  y       for  the  "proportional  effect"  model 


The  relationships  between  successive  changes  in  X  for  changes  defined 
over  different  intervals  is  relatively  simple  to  analyze  for  the  "proportional 
effect"  model  in  terms  of  the  estimate  of  y       defined  in  (2.4).  First  of  all,  it 
should  be  evident  from  the  definition  of  the  "proportional-effect"  model  that,  since 
the  effect  of  a  random  shock  on  subsequent  values  of  a  time  series  if  assumed  to  be 
independent  of  elapsed  time,  7?R  is  unrelated  to  the  length  of  the  forecast  horizon.1 
This  can  be  seen  by  generalizing  equation  (3.1.5)  to  define  the  following  relation- 
ship between  X,   and  X ,  t 


The  fact  that  predictions  derived  from  an  exponentially-weighted  moving  average 
are  unrelated  to  the  forecast  horizon  over  which  the  predictions  are  made  has 
previously  been  stated  by  Muth  [47], 
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(3-2.1)  X+tV,     =  X,  +  b  X    Si       £ 


t+F-1 

Since  Xj  =  Xj  -  b  £  then,  denoting  (xt+f  -  Xt)  by  A^  equation  (3.2.1)  can  be 
rewritten  as 

t+F-1 


(3.2.2) 


^  -  Kt+F  +  ^s    fj  -bd-x)  £t. 


j=t+i 


Since  A^  the  change  in  X  over  some  interval  of  R  periods  prior  tb  t,  contains  no 

information  about  ^     for  j->t,  it  is  evident  that  E  [A  \a_]  is  unaffected  by 

>j  for  j^t.  Moreover,  since  ?Ap/'d£t  is  a  constant,  the  independence  of  the 

conditional  expectation  from  all  h,    jp>t,  then  implies  that  7_  is  a  constant  for 

•'J  FR 

any  given  value  of  R. 

A  corollary  of  this  statement  is  that  7_  is  a  function  of  X.     Moreover, 
since  Ap  is  itself  dependent  on  £  for  t-R<j<-t,  it  is  evident  from  the  same  line 
of  argument  that  7pR  is  also  a  function  of  R.  By  extension  of  (3.1. 5), 

(3.2.3)  ^  .  Ht+^J:Rtl  <j-Mi-^t.R 

Both  (3.2.2)  and  (3.2.3)  can  then  be  substituted  into  (2. If)  to  yield  the  following 
equation  defining  j % 


t+F-1  /        x     v 

(3.2-M  b^t+F    +bx    2    <J  "b(1->•,  Ct 

I  shall  specify  that  u^  satisfies  the  usual  least-squares  assumptions  in  order  to 
obtain  the  expected  value  of  the  estimate  of  7__  defined  in  (2.3).  Moreover,  since 
^  (t)  is  a  pure  random  process,  E  [  ^  £  ]  =  0  for  j  /  t.   Consequently,  multi- 
plying both  sides  of  (3.2.+)  by  the  term  in  square  brackets  of  that  equation  and 
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f orming  expected  values  of  the  resultant  product  moments,  we  obtain 

7ra=  -b2(l-X)  var  frt 

(3.2.5)  *  o     .      2  2  t_1 


b^  var  £  +  b\  JT     var  f.   +  b2(l-2x)  var  £.  _ 
^t       j=t-R    bJ  bt_R 

Since  the  denominator  of  (3.2.5)  is  by  definition  positive,  then  since  0<\<1 
it  is  evident  that  7   is  always  non-positive  and  indeed  is  strictly  negative  for 
X<1.  Any  time-series  satisfying  the  assumptions  of  the  model  defined  in  section 
3.1  is  thus  regressive,  in  the  trend-reversing  sense  defined  by  analysts  of  anti- 
cipations surveys,   as  an  automatic  consequence  of  the  presence  of  a  transient 
component  in  the  random  shocks  to  which  the  series  is  subject. 
Since  var  r"  =  1,  (3.2.5)  reduces  to 

(3.2.6)  7FR  =   .   ~  u 

FR   2(l-\>f  XT* 

so  that  7^  depends  only  on  X  and  R  and  not  on  b.  Chart  1  portrays  the  relation- 
ship between  -7FR  and  different  values  of  X  and  R.  It  is  evident  from  this  equation 
that  7FR  decreases  monotonically  from  zero  to  -l/2  as  the  transcience  of  the  random 
shocks  increases s  the  smaller  the  value  of  X,  the  more  evanescent  the  random  shocks 
and  the  greater  the  degree  of  regress iveness  in  the  resulting  time  series. 

3°3  Two  special  cases 

The  upper  bound  of  X  defines  a  special  case  of  the  model  defined  in 
section  3.1  which  is  of  particular  interest.  When  X  =  1,  then,  substituting  this 
value  of  X  in  equation  (3. 1.8), 

(3.3.1)  xt  -  xt_1  . 


See  for  instance  [10] 


-22- 

CHART  1 
RELATION  BETWEEN  -y,    \,    AND  R 
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Since  X^  =  Xt  +  X,  E[Xt  )  Xt_1.,  . ..]  =  X^^.  The  stochastic  process  defined  by  this 
special  case  is  thus  a  random  walk.  As  might  be  expected,  substitution  of  \  =  1 
in  equation  (3.2.6)  yields  y^  =   0  so  that,  for  any  A^  E(Ap)  =0.  In  other  words, 
knowledge  of  past  changes  in  X  provides  no  information  about  future  changes  --a 
statement  which  is  the  classic  definition  of  a  random  walk. 

Because  in  a  perfect  market  prices  will  be  set  so  as  to  reflect  the  expect- 
ations about  future  prices  held  by  marginal  buyers  and  sellers,  any  changes  in  prices 
(apart  from  growth  at  a  rate  equal  to  the  marginal  time-plus^risk  discount  on 

future  income)  will  reflect  unanticipated  events  that,  a  priori,  can  only  be  viewed 
as  realizations  of  a  pure  random  process.  Since  the  time  series  formed  by  the  move- 
ment through  time  of  a  price  which  is  determined  in  a  perfect  market  will  thus  be 
a  random  walk  if  expectations  are  "rational,  ,,:L  this  special  case  of  the  model  defined 
in  section  3.1  is  of  particular  interest  to  economists.  Several  authors  have  sug- 
gested, for  instance,  that  even  though  some  imperfections  undoubtedly  exist  in 
security  and  commodity  markets,  prices  in  such  markets  are  essentially  random  walks.2 
In  addition,  a  random  walk  may  also  describe  other  series  not  determined  in  perfect 
markets.  One  example,  constructed  in  terms  of  a  highly  imperfect  market,  might  be 
the  cumulative  sales  made  by  individual  salesmen  in  a  market  where  personal  con- 
tacts-; are  highly  important. 

This  condition  is  sufficient  but  not  necessary.  Expectations  need  not  be  "rational" 
in  the  Muthian  sense  of  being  unbiassed;  it  is  necessary  merely  that  any  systematic 
relationship  between  expectations  and  outcome  be  invariant  over  time. 

2 

See  for  instance  Alexander  [l],  Bachelier  [3],  and  Kendall  [35],  and  the  further 
exposition  of  this  view  in  Cootner  [15],  Houthakker  [Qa],    and  Roberts  [58]. 

3 

Random  shocks  would  in  such  a  model  represent  primarily  the  salesman's  gains  or 
losses  of  contacts. 
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The  stochastic  process  defined  by  (3=3.1)  is  of  course  an  unconstrained 
random  walk.  In  a  number  of  applications,  a  random  walk  may  be  bounded  by  absorb- 
ing or  reflecting  barriers.  A  random  walk  with  absorbing  barriers  (as  in  Russian 
roulette)  is  particularly  interesting  in  economic  applications  because  of  the 
problem  of  evaluating  the  "value"  of  ruin.  While  such  constrained  random  walks 
are  not  fully  described  by  ( 2.3.1),  y       is  nevertheless  not  affected  within 
known  barriers  which  are  sufficiently  far  away  from  most  values  of  X, . 

A  second  special  case  of  some  interest  is  defined  by  the  lower  bound  of 
\.  When  \  -   0,  then  substituting  this  value  of  X  in  (3.1.8)  obtains 

(3.3-2)  Xt  =  Xt_^ 

so  that  by  extension  X  =  X.  for  all  values  of  j.  Successive  values  of  the  time 
series  defined  by  (3»3°2)  can  thus  be  viewed  as  sample  observations  drawn  from 
an  identical  population.  That  the  values  of  these  observations  are  independent 
of  their  ordering  in  time  is  merely  another  way  of  saying  that  each  random  shock 
is  completely  transient. 

Substituting  X  =  0  into  (3.2.6)  yields  r_,  =  -  l/2  for  any  value  of  R. 

"in 

While  this  result  may  not  be  intuitively  obvious,  it  can  be  explained  by  consider- 
ing that  random  variation  in  A,  may  be  attributed  to  either  X.  or  X,  R  with  equal 
likelihood.  In  other  words,  since  A  =  £".  -  <^+  R  because  X ,  =  X    (both  £* 
and  ^+_R  being  identically  distributed  with  zero  expected  value),  E  [A,  \  A-J 
may  be  regarded  as  being,  with  equal  likelihood,  any  linear  combination  of 

See  for  instance  the  model  of  dividend  policy  cast  in  these  terms  contained  in 
Shubik  [62].  The  classical  "gambler's  ruin"  problem  is  discussed  in  Feller 
[22,  pp.  313-318]. 
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K[Ap  |^  =^t]  and  E  U^  \^  -  -  ft_R].  Since,  from  (3.2.2), 

(3.3-3)  E  [Ap  i£t]  =  -  ft 

1 
and 

(3.3A)  E  [Ap,  \   £t_fi]   =  0, 

it  is  thus  evident  that  E  [A_|A_]  =  -1/2  A^  so  that  7   =  -  l/2.  It  is  in  any  case 
clearly  evident  that  7^0;  that  if  past  change  has  been  in  one  direction  then 
future  change  over  any  forecast  horizon  will  on  the  average  be  in  the  other  direc- 
tion, and  that,  moreover,  the  larger  the  magnitude  of  the  past  change  the  more  pro- 
nounced the  typical  reversal. 

These  two  cases  provide  the  basis  for  an  alternate  definition  of  the 
model  developed  in  section  3.1.  Construct  the  following  weighted  average  of  the 
definitions  of  each  special  case  stated  in  (3»3«l)  and  (3.3«2); 

(3.3.5)        xt  =  p  xt_1  +  (i-3)  xt_1,  ofpti. 

It  is  evident  that  the  model  defined  by  (3.3-5)  is  identical  to  (3. 1.8)  for 
P  =  X.  The  "proportional  effect"  model  or  exponentially-weighted  moving  average 
can  thus  be  regarded  as  in  effect  postulating  that  a  time  series  is  a  linear  com- 
bination of  a  random  walk  and  of  a  set  of  independent  samplings  from  a  common 
probability  distribution. 

3  A  Implications  of  X   for  the  variance  of  X 

An  analysis  of  the  stability  of  the  variance  of  X  through  time  can  provide 
a  reasonably  fruitful  test  of  the  applicability  of  either  a  random  walk  or  a  set 
of  independent  identically-distributed  observations  as  a  description  of  X,  and 
in  the  process  provide  an  estimate  of  \.  From  (3.1.2)  and  the  postulated 
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independence  of  successive  r   ,  it  follows  that 
(3.^.1)  var     xt   =  (t  X2  +  1)   var  \ 

Similarly, 

(3.^.2)  var   Xt_R  =  [(t-R)  \2  +  l]     var  \ 

so  that 


(3«^.3)  var   X.   =  var  x.  _ 

w  t-R 


t  X2   +  1 


(t-R)\2  +  1 


If  X  =  0,  as  in  the  case  of  a  time  series  formed  by  successive  independent,  identi- 
cally distributed  observations,  then  the  variance  of  X  is  unchanged  over  time.  If 
X,  0,  then  var  (X)  will  increase,  though  at  a  declining  rate.  If  \  =   1,  as  in  the 
case  of  a  random  walk,  then  (3.U.3)  degenerates  to 


(3AA)  var  Xt        t  +  1 


var  X.  _       t-R+1 
"t-R 

a  well-known  property  of  a  random  walk. 

Alternatively,  attention  may  be  focused  on  the  ratio  of  the  variance 
of  ^  to  that  of  Aj,  where  Aj_  =  Xt  -  X^.  This  ratio  (which  will  be  denoted  by 
JR)  provides  a  mare  sensitive  test  of  whether  a  time  series  is  a  random  walk. 
From  (3.2.3),  it  is  evident  that 

(3-^.5)  var  ^     =  [jJlR  +  2(l-\)]  var  £j 

so  that,  for  R?-0, 

(3A.6)  Jr  =  var^R   „  X2R  +  2(1-X) 

var  /^      \  +  2  (l-\) 

For  a  random  walk,  X  =  1  so  that  JR  =  R.  The  variance  of  change  in  X  is  thus  then 
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proportional  to  the  length  of  time  over  which  such  change  is  analyzed.  For  the 
"timeless"  time  series  defined  by  the  second  special  case,  \  =  0  and  the  variance 
of  change  in  X  is  unrelated  to  the  duration  of  change.  More  generally,  if 
0<-\«l,  then  the  relationship  between  JR  and  R  is  shown  by  Chart  2  for  different 
values  of  X,„ 

Paul  Cootner  [15]  has  found  that,  for  stock  prices,  the  variance  of  price 
changes  over  periods  of  R  weeks  was  generally  significantly  different  from  R  times 
the  variance  of  changes  over  one  week.1  It  is  thus  evident  that  the  time-pattern 
of  stock  market  prices  is  not  a  random  walk  (as  hypothesized  in  the  studies  by 
Alexander  [l]  and  Kendall  [35]  cited  supra  as  applications  of  a  random  walk). 
In  addition,  he  found  significant  curvature  in  the  relation  between  J  and  R. 
While  it  may  be  that  this  evidence  substantiates  either  the  model  defined  in 
section  3.1  or  the  constrained  random  walk  suggested  by  Cootner  as  a  model  of 
stock  prices,   neither  is  sufficient  in  itself  to  account  for  changes  in  dj/dR. 
It  may  be  that  such  changes  can  be  explained  in  terms  of  a  drift  with  time  in  the 
reflecting  barriers  of  Cootner 's  model  or  in  the  expected  value  of  t     in  the  model 
of  section  3.1.  It  is  in  any  case  evident  that  the  simple  model  defined  in  section 
3.1  is  not  adequate  as  there  postulated. 

I  shall  not  pay  further  attention  in  this  paper  to  the  tests  of  observed 
time  series  suggested  by  this  discussions.  A  discussion  of  j(R)  for  the  other 
models  advanced  in  this  paper  will  be  presented  elsewhere. 


1 


The  model  defined  in  section  3.1  can  be  considered  as  having  essentially  identi- 
cal properties  to  those  of  a  random  walk  constrained  by  a  large  number  of  partly 
reflecting  barriers  defined  at  a  large  number  of  distances  from  some  "normal" 
price  defined  in  each  period.   (A  physical  analog  would  be  a  set  of  successive 
lattices  of  crystals,  in  which  each  lattice  reflects  only  a  portion  of  the  parti- 
cles which  have  passed  through  previous  lattices).  In  terms  of  a  model  of  common 
stock  prices,  the  model  defined  in  section  3.1  thus  assumes  a  spectrum  of  classes 
of  investors  in  place  of  the  two  classes  postulated  by  Cootner,  each  class  in  the 
spectrum  having  somewhat  different  opportunity  costs  of  search  and/or  different 
potential  amounts  of  information  which  might  be  unearthed  by  such  search.) 
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CHART  2 


RELATION  BETWEEN  J  AND  R 
FOR  SELECTED  VALUES  OF  \ 
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*•  A  two-stage  process 

The  simple  exponentially-weighted  moving  average  discussed  in  section  3 
is  a  descriptive  model  of  an  economic  time  series  only  if  the  following  assumptions 
are  satisfied;  (l)  each  observation  of  the  time  series  is  determined  up  to  a  cur- 
rent random  shock  by  the  effects  of  previous  random  shocks,  (2)  each  random  shock 
is  a  realization  of  a  pure  random  Gaussian  process  with  zero  mean,  and  (3)  the 
effect  of  each  random  shock  is  all  identical  for  all  subsequent  time.  In  this 
section  I  shall  relax  assumption  (2)  while  maintaining  assumptions  (l)  and  (3). 
To  do  so  I  shall  define  a  two-stage  model  of  X(t),  assuming  its  second  stage  to 
be  the  "proportional  effect"  model  developed  in  section  3.1.  The  output  of  the 
second-stage  model  is  thus  defined  as  the  sum  of  a  current  random  shock  plus  the 
persistent  components  of  previous  random  shocks,  each  shock  being  in  turn  defined 
as  the  output  of  a  first-stage  model.   In  section  fc.l,  I  shall  first  of  all  specify 
this  first-stage  model  as  a  linear  transformation  of  the  pure  random  Gaussian 
process  £ (t),  relaxing  merely  the  assumption  that  the  random  shocks  are  a  realiza- 
tion of  a  process  with  zero  mean.  Even  maintaining  all  other  assumptions  previously 
specified  about  the  random  shocks,  the  model  of  section  4.1  necessitates  a  substan- 
tial modification  of  the  conclusions  reached  in  section  3  regarding  the  behavior  of 
7FR.  I  shall  then  go  on  in  section  4.2  to  relax  the  assumption  that  each  random 
shock  is  identically  distributed  and  in  section  4. 3  to  relax  the  assumption  that 
each  shock  is  independently  distributed. 

4.1  Random  shocks  with  non-zero  mean 

Other  than  to  assume  that  a  random  shock  is  a  realization  of  the  pure 
random  process  {\  (t)}   =  b  {f  (t)}  ,  as  postulated  in  equation  (3.l.l),  the 
next  simplest  assumption  that  can  be  made  is  to  continue  to  assume  that  it  is  a 
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realization  of  a  pure  random  process  (so  that  the  elements  of  any  set  of  random 
shocks  are  independently  and  identically  distributed)  but  at  the  same  time  to 
relax  the  assumption  that  the  mean  of  the  process  is  zero.  Define  a  process 
%   %  (i)j'    such  that 

(*.1.1)  4  j  -  atb?. 

The  expected  value  of  ^.   is  then  a,  and  b2is  the  variance  of  the  process.  Since 
^j  isa  linear  transformation  os  a  pure  random  Gaussian  process,  it  is  itself  a 
pure  random  Gaussian  process.1  The  only  assumption  governing  |  which  has  been 
relaxed  is  that  £g  J  be  a  linear  homogeneous  transformation  of  CV  "I  . 

The  implications  of  this  relaxation  for  the  nature  of  the  stochastic 
process   X(t)  of  which  an  observed  time  series  is  a  realization  can  be  shown 
quickly.  Applying  the  "second-stage"  model  of  section  3  to  the  random  shocks 
defined  by  (4.1.1). 

J-it     J 

as  in  (3.1.2).  Substituting  (4.1.1)  in  (4.1.2),  the  two  stages  of  the  model  can 
be  combined  to  define  Xt  in  terms  of  the  underlying  pure  random  process  ft.\    t 
(^•1.3)  X   .  a  (1  +  xt)  +  b 


frt  +  \^    K 


j<t 


so  that  Xt,  the  conditional  expectation  of  X.  given  previous  realized  values  of 

C*-1^)  Xt  =  a  (1  +  xt)  +  b  X  £.    £  „   . 

j<t    ^  J 


1T,     . 

It  may  in some  circumstances  also  be  appropriate  to  relax  the  assumption  that 

i  ^  (j) J   is  not  only  a  pure  random  process  but  also  Gaussian  —  in  other  words, 
to  relax  the  restrictions  on  third  and  higher  moments  of  the  distribution  of  each 
^  ..  This  extension  will  not  be  made  in  this  paper. 
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It  is  evident  that  equation  (4.1.4)  does  not  reduce  to  an  exponentially-weighted 
average  of  previous  observed  values  of  X.  From  (4.1.3)  and  (4.1.4), 

Xt  -  Xt  =  b  t  and  Xt  -  Xt_1  =  X  '  t_1   =  X  [a  +  b  £+_]]•  Consequently, 

(4.1.5)  Xt  -  Xt_,  =  X  a  +  X  [Xt_x  -  Xt_x] 

so  that,  rearranging  and  iterating, 

t-1 

(4.1.6)  X   =  X   .;  (1  -  X^'J"1  X.  +  a  [1  -  (l-X)*]. 

3=0  J 


This  expression  diverges  from  the  exponentially- weighted  average  of  (3.1. 9)  by  an 
amount  which  (for  t  greater  than  say  lO/x  ),can  be  effectively  regarded  as  a  constant 
equal  to  a,  the  mean  value  of  :   „ 

The  implications  of  this  two-stage  process  for  y       likewise  diverge  from 

FR 

those  of  the  model  discussed  in  section  3.  First  of  all,  it  is  apparent  from 
(4.1.3)  that 

t+f-1 

(4.1.7)      AF  =  axF  +  b  crt+F  +  x  e    :<  -  d-x):t] 

so  that  the  unconditional  expected  value  of  A  is  aXF  rather  than  zero  as  in  the 

single-stage  model.  Because  of  the  trend  thus  introduced,  y       is  a  function  of  F 

FR 

as  well  as  of  X  and  R.   In  addition,  it  is  evident  that  7_0  is  also  a  function  of 

FR 

the  parameters  of  the  first-stage  model  defined  in  (4.1.1).  From  (4.1.3), 

t+F-1 
(4.1.8)      A^  =  aXR  +  b[;   +  X  -  (1-X)%   ] 

Substituting  (4.1.7)  and  (4.1.8)  into  (2.3),  . 

and  forming  expected  values  of  the  resultant  product  moments, 
we  obtain 
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(4.1.Q)         y      =     \  a  FR  +  b  (\-i) 

FR    ,y  2^2 ,2  -  2^ 7 — 

\  al?  +  b  [ATR  +  2  (l-\)] 

which  reduces  to  (3.2.6)  for  a  =  0.  It  is  evident  that  vm   is  in  this  model  not 

'  FR 

necessarily  non-positive.  Indeed,  since  the  denominator  is  positive,  j       is  posi- 

i  R 

tive  if  and  only  if  the  following  inequality  is  satisfied; 
(4.1.10)         X2a2   FR       b2  (l-\). 

Condition  (4.1.10)  can  be  expressed  in  the  following  way:  y       is  positive  if  X 

FR 

is  greater  than  some  A,  defined  by  the  equation 

(4.1.H)  _2 =  |  =  v  , 

where  V  is  the  coefficient  of  variation  of  .  Chart  3  shows  the  relationship 
between  \q   and  V  for  selected  values  of  the  product  of  F  and  R. 

As  is  intuitively  evident  from  the  fact  that  \2a2FR,  the  left  term  of 
expression  (4.1.10),  is  the  product  of  the  trends  defined  over  the  lengths  of  the 
forecast  and  record  horizons,  chart  3  shows  that  the  longer  these  horizons  (other 
things  being  equal),  the  greater  the  range  of  \  and  V  for  which  y   is  positive. 
The  chart  also  shows,  however,  that  the  size  of  the  range  of  FR  and  X  for  which 
'7FR  is  P°sitive  decreases  as  V  increases.   Indeed,  for  infinite  V  (as  in  the  case 
of  a  =  0  for  any  non-zero  b),  7^  is  non-positive  for  all  defined  values  of  X,  and 

is  strictly  negative  if  \  1. 

For  any  values  of  A,  F,  and  R  there  is  thus  some  V'  such  that,  for 
V  V'''  7FR  is  neSatlve»  Defining  the  degree  of  regress ivity  in  a  time  series  as 
the  degree  to  which  7FR  is  negative,   it  is  evident  that  the  degree  of  regress ivity 


This  follows  the  definition  of  regress ivity  used  [9,  p.  246 j  and  [lO]. 
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is  an  increasing  function  of  V  but  a  decreasing  function  of  \  and  FR.  Holding 
V  and  FR  constant,  the  degree  of  regressivity  in  a  time  series  is  a  decreasing 
function  of  \   alone,  as  in  the  model  discussed  in  section  3,2. 

It  may  be  useful  to  discuss  the  implications  of  the  "first-stage"  model 
defined  in  (4.1.1)  for  the  values  of  /FR  in  the  two  special  cases  of  the  "second- 
stage"  model  of  section  3  defined  by  X   =  0  and  \   =  1.   If  \  =   0,  then,  substituting 
in  (4.1.9),  7FR  =  -  1/2.  In  this  special  case  the  properties  of  7       are  thus  inde- 
pendent  of  the  parameters  of  the  "first-stage"  model.  This  is  not  true  for  the 
random  walk  defined  by  \  =  1,  however.  In  this  case  substitution  of  \  =  1  into 
(4.1.9)  yields 

(^  1.12)  7  =    aFR 

7FR     aV  +  b^R 

which  is  non-positive  only  if  a  =0.  This  result,  again,  is  consonant  with  what 
one  should  intuitively  expect.   Since  a  random  walk  is  merely  a  summation  of  pre- 
vious random  shocks,  a  non-zero  expected  value  of  those  shocks  results  in  the 
random  walk  being  defined  around  some  positive  or  negative  trend.   Consequently 
E  tXt+F  xt^  "  Xt  =  aF  so  that;  since  A^  on  the  average  equals  aR,  the  directions 
of  change  tend  to  be  the  same  over  both  forecast  and  record  horizons  --a  statement 
which  is  equivalent  to  stating  that  7__  0  for  non-zero  values  of  a. 

4.2  Non-identically  distributed  shocks 

Up  to  this  point,  it  has  been  assumed  that  the  random  shocks  described  by 
the  intermediate  process   (j)  are  independently  and  identically  distributed.   I 
shall  now  drop  the  assumption  that  they  are  identically  distributed,  first  examining 
the  case  where   (j)  is  merely  heteroscedastic,  and  then  going  on  to  examine  the 
more  general  case  where  not  only  Var   (j)  but  also  E  [   (j)]  are  dependent  on 
time. 
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4.2.1.   Heterscedastic  s(,i)   I  shall  assume  time -dependence  In  the 

variance  of  ^(j)  such  that  Var  \     =  (l  +  f)  Var  f  .   for  all  j,  where  f^-1. 

J  J-  J- 

Further  defining  Var  |  Q  =  b  Var  >   =  b2,  then,  since  Var  £  .  =  Var  £ 
(4.2.1)  Var  t.   =  b2  (l+f)j  Var  £   . 

Consequently,  defining  as  before  E( £;  .)  =  a, 

J 

C^.2.2)  ^    =  a  +  b  VH+f?  f  , 

Substituting  this  definition  of  if.  in  equation  (4.1.2),  the  combined  two-sta 
process  thus  becomes 


J  §e 


(^-2-3)  Xt     =     a(l+\t)  +  bt^yflrf)1     +     x    S     ^.^l+f)1"]     , 

which  reduces  to  equation  (1+.1.3)  when  f  =  0,      It  can  easily  be  shown  that 

(4.2.4)  7       =  ^^m  +  ^(X-lJCl+f)* 

\'a¥  +  b*[S] 

where     S     =   (l+f )*  +  \2  g       (l+f)J   +   (l-2\)(l+f )t_R 

j=t-R 

This  equation  of  course  reduces  to  equation  (4.1.9)  if  f  =  0.   It  is  evident  that 
the  degree  of  regressivity  in  X  is  related  to  f .  Specifically,  it  is  clear  that 
7FB  is  P°sitive  only  if  ^  is  greater  than  some  \     ,  where  \       is  larger  than  the 
\Q   defined  in  equation  (4.1.11)  if  f?»0  but  smaller  than  \  if  f^o.  As  in 
section  4.1,  7FR  is  non-positive  for  all  defined  values  of  \  and  f  if  a  =0. 

4.2.2.  Time  dependent  mean;  Case  I  If  %  not  only  is  heteroscedastic 

but  also  has  an  expected  value  which  drifts  with  time,  it  is  necessary  to  further 

modify  earlier  statements  about  the  behavior  of  t_.  I  shall  examine  y       for  two 

FR  'FR 

different  models  of  time-drift  in  f :  assuming  in  the  first  case  that  E  ( f  )  =  a  +  ct 
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and  then  in  the  second  case  that  E(  \,  )  =  a  +  d  log  (t+l).  Heteroscedasticity 
vill  be  defined  as  in  section  4,2.1. 

In  Case  I,  it  is  assumed  that  there  is  a  constant  linear  drift  in  s  so 
that  E(^  .  -  ^    )  =  c.  Defining  E(  £n)  =  a,  then  E(  £  .)  =  a  +  cj,  so  that,  from 
this  definition  and  from  (4.2.1), 


(4.2.5)  ^.        =  a  +  cj  +  b  V(l+f)J  £j  • 
Substituting  (4.2.5)  in  equation  (4.1.2), 

(4.2.6)  Xt  =  a(l+\t)  +  ct[l  +  |  (t-1)]  +  b  Vl+f)*  £ 

Consequently,  deriving  7   as  before, 

FR 


(*.a.T)     rFR  -  *R[L]  +  b2(x-i)(i+f f 
FR     b2[s]  +  r2Ek] 

where     L  =  a2\2  +  \ac[2(l+Xt)  +  \(^r  -  l)] 

+  c2   [1  +  X(t  +^i)][l  +  \(t   -  2±i)] 

S   =   (1+f)*  +  X2    22      (l+f)J   +   (l-2x)(l+f)t"R 
j=t-R 

K  =  a\2  +  2Xac    [1  +  \(t   -  5+i)]   +  c2[l  +  \(t   -  5+i)] 


If  c   =  0,    equation  (4.2.6)   reduces  to   (4.2.3)  and  equation   (4.2.7)   reduces  to 
(4.2.4).     As  before,    7        is  positive   if  and  only  if  the  numerator  of   (4.2.7)   is 
positive.     Consequently  7__,  is  positive  if  and  only  if 
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(4.2.8)  A^R  +'cFB[w]     >      b2   (l-xXl+f)* 

2 
2    i/ua   +   t:   +   ca.  -   2J    "  r 


where   W  =  (F-R)  5  [Xa  +  c  +  cX  -  £]  -  £  FR 


+  t\2  [2a  +  |2.  +  c(t-l)] 
A, 

+  X  (2a  -  Xa  -  c  +  £  )  +  c 

This  condition  can  be  written  in  the  following  way:   7^  is  positive  if  and  only 

■» 
0 


'FR 
if  X  is  greater  than  some  Xn   defined  by  the  equation 


(^.2.9)         i\**f   FR  -  -%  FR  [W]   =  ^  (l+f  )t  (l-X  **) 

2a  a 

where  W  is  defined  in  (4.2.8). 

Comparing  this  condition  to  the  one  discussed  in  subsection  4.2.1  for 
series  which  are  formed  from  random  shocks  that  are  time-dependent  only  in  their 
variances,  it  is  clear  that  of  course  \         =  \    if  c  =  0.  For  non-zero  values 

of  c  the  comparison  is  not  so  clear,  however,  for  Xn   may  be  larger  or  smaller  than 

* 

XQ  depending  on  the  values  of  a  and  c  and  of  the  time-parameters  F,  R,  and  t.  The 

reason  for  this  is  apparent  enough:   in  the  general  case,  it  cannot  be  specified 
whether  the  time -dependent  and  non-time-dependent  trends  defined  respectively  by 
c  and  a  reinforce  one  another  or  the  opposite.   If  they  reinforce  one  another,  the 
effect  will  be  to  make  7pR  larger  than  it  would  be  in  the  absence  of  the  time- 
dependent  trend.  If  they  counter  each  other,  the  marginal  effect  of  the  time- 
dependent  trend  is  to  increase  the  regressivity  of  the  series. 

It  is  possible  to  make  more  precise  general  statements  in  the  special 
cases  defined  by  X  =  0  and  \  =  1.  If  X  =  0,  then, substituting  this  value  of  X  in 
(4.2.7), 
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,.       "  FR  c2  -  b2   (l+f)* 

(it. 2.10)  7_  = 

FR     b2[(l+f )*  +  (l+f )*"»]  +  R2c2 

It  is  thus  evident  that  7pR  is  equal  to  -  1/2  for  all  values  of  a,  b,  t,  F,  and  R 
only  if  c  =  f  =  0.  If  only  c  =  0,  then  7pR  is  still  strictly  negative,  but  can 
take  on  any  values  in  the  open  interval  between  -  1  and  zero  as  indicated  by 

(4.2.11)  rFR  _   _     (i+f)R 

d+f  )R  +  i 

If  c  is  also  non-zero,  it  is  evident  from  (4.2.10)  that  7__  may  even  be  positive. 

FR 

Time  dependence  in  the  random  shocks  thus  substantially  affects  the  behavior  of 
7^3  in  this  case. 

Turning  to  the  second  boundary-defined  special  case,  equation  (4.2.7) 
reduces  to  the  following  expression  when  \   =  1: 

FR  [a2   +  ac  [ 2(l+t)+  ^  -l)3  +  c2[l+t+  Zli][i+t.5+l]} 


b2  £    (l+f )j  +  R2[a2  +  2ac  (l+t-  *£)  +  c2(l+t-*±i)2] 
j=t-R+l  *  A 

The  most  obvious  point  that  can  be  made  about  the  range  of  7__.  defined  by  this 

FR 

equation  is  that  of  course  it  reduces  to  that  defined  by  (4.1.12)  if  c  =  f  =  0. 
It  is  clear  that  7pR  is  not  necessarily  non-positive  if  a  =  0.  If  a  =  0,  then 
(4.2.12)  reduces  to 


,.     .              FR  c*(l+t+  ^i)(l+t  -  £i±) 
C^.2.13)      7FR  =  — ^ Z. 1 

b2   £     (lff)J  +  c¥(l+t^)2 
j=t-R+l 

so  that  7   is  in  this  case  non-positive  if  and  only  if 


2,n  .  .  F-lx/n    R+l\   ^ 


(4.2.14)      PRc(l+t+^i)(l+t-^)   ^   0. 
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Since  t>R,  this  condition  is  satisfied  only  if  c  =  0.   7   can  thus  be  positive 
even  when  a  =  0^  as  one  might  expect  from  the  fact  that  any  trend  around  which 
the  random  walk  is  defined  has  the  same  sign  in  both  A_  and  Ap  no  matter  whether 
or  not  the  trend  is  time-dependent. 

4.2.3  Time -dependent  mean:  Case  II   It  may  for  some  series  be  more 
realistic  to  assume  the  existence  of  a  decreasingly- important  drift  in  \    rather 
than  the  constant  drift  postulated  in  subsection  4.2.2.  One  model  of  drift  which 
satisfies  the  assumption  of  a  constantly  decreasing  expected  value  of  change  in 
5  is  that  obtained  by  specifying  that  E  [  ^ .  -  £.   ]  =  d  log  [(j  +  l)/j]  . 
Specifying  this  model  and  further  specifying  as  before  E  (  £n)  =  a  ancl  hetero- 
scedasticity  as  in  (4.2.1),  then  E  (  ^  . )  =a  +  d  log  (j  +  l)  and 


(4.2.15)  %,      =  a  +  d  log  (j+1)  +  b  V(l+f)j  > 
Substituting  this  equation  in  (4.1.2), 

(4.2.16)  Xt  =  a(l+\t)  +  d[log  (t+1)  +  \  ^t  log  (j+1)] 


+  bVo+7^^t  +  xb  „§t  y^f^. . 


Comparing  this  model  with  that  defined  in  (4.2.6),  it  is  evident  that 
the  only  difference  between  the  two  models  is  the  lesser  importance  of  the  time- 
dependent  trend  in  (4.2.16).  The  behavior  of  7   will  consequently  correspond  to 
that  described  in  subsection  4.2.2,  subject  to  this  qualification. 

4.3.  Interdependent  shocks 

To  generalize  the  first- stage  model  thus  far  developed  to  allow  for  the 
possibility  of  interdependency  in  the  random  shocks  generated  as  inputs  for  the 
second-stage  model,  I  assume  an  additive  difference  between  the  unconditional 
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expected  value  of  £  ,  E(  t .),    and  the  ejected  value  of  V,  given  the  values  of 
preceding  shocks,  E( ^  .  j  ^  ^  ^   , ...).  I  shall  postulate  as  in  section  4.2.2 
that  E(  £  )  =  a  +  c  ,  and  shall  further  postulate  the  first  order  dependence  in 
\   defined  for  j  7  0  by 


(4.3.1)        E 


( h  I  h-v  *  j-2--°  =  a  +  CJ  +  s[  fj-i  - E  ( W] 


If  g  =  0,  this  conditional  expected  value  reduces  to  the  unconditional  expectation 
E  (  |  ).  If  g-cO,  then  the  series  formed  by  the  deviations  of  the  shocks  from 
their  trend  is  regressive;  if  g^O,  the  opposite  is  true.  An  interesting  case  is 
defined  by  g  =  1;  in  this  case  \        is  defined  as  a  random  walk  around  the  trend 

J 

defined  by  the  parameters  a  cna  c  since  (4.3.1)  then  reduces  to 

Further  postulating  that  the  variance  of  X  .   is  defined  by  (4.2.1)  if  g  =  0,  it  is 

J 

evident  from  (4.3.1)  that 

C^-3-3)  \  ,   =  a  +  cj  +  b  Y(l+f)J  £ „  +  g[  |._x  -  a  -  c(j-l)] 

for  j?0  so  that,  iterating  within  the  right-hand  term  of  (4.3.3), 
(^.3.4)  ^  =  a  +  cj  +  b  £   g1  Vd+f)^1  > 

for  all  j.  It  is  of  course  possible  to  assume  more  complicated  interdependence  in 
the  §  ,  but  I  shall  restrict  myself  in  this  paper  to  that  defined  by  (4.3.1). 

Substituting  (4.3.4)  in  the  second-stage  model  defined  by  (4.1.2),  the 
combined  two-stage  model  thus  becomes 


4l- 


C^.3.5) 


Xt  =  a  + 


ct  +  b  |   g1  ]£^"i  ^ 


t-i  r 


+  x 


J=o 


a  +  cj 


i=0  V~x 


Since 


t-1   j 


(^.3.6)    £ 


t-i  t-j 


1=0  ^J       i=0  1=0  s 


we  can,  after  closing  sums,  rewrite  (^.3.5)  as 


(^•3.7)  xt  =  a(l+\t)  +  ct  [1  +  |  (t-1)]  +  b  Yl+f)t   f . 


t-l 


b  £     [gt_j  +  t\  (i+g^^1)]     7^flJ    £  . 
j=o  ±_s  '  *»  J 


Consequently,  proceeding  in  the  usual  manner  to  find  j. 


FR' 


(>.3.8)  7m  ■    fflfc]  +  b   fe]  . 

IT    [V]   +  R^h] 


where  L  =  a2\2  +  Xac  [2(l+\t)  +  X  (^  -  l)] 


+  c2   [1+   X  (t  +  Fzi)][i  +  x  (t   -  2+i)] 


F         X 


F+l, 


t 


[gr    +  ^  (W+±)    -   1]    (1+f ) 

+     £       Eg*-"5  -  ^(i^-^JlCCg^-J  -  g^Jd  +  *-)] 

j=t-R+l  X  g  1_g 

+    [gR  +  ^d^1)    -   l][(gF+R  -   gR)(l  +  ^)](l+f) 

j=o  1_e 
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K  =  a\2  +  2Xac   (i+x[t  -  2+i])  +  c2  (l+x[t  -  ^]) 

V  =  (l+f)t  +     £X     [gt-J  +     X     (l+g*-^1)]2  (itf)J 

j=t-R+l  ■L~g 

R  .,  _A_  /-,      R+l\  ,i2   ,„    ^t-R 


+  fe  +  i~  a+s    )  -  ir  (iff) 

+  ~l~]     [(gt_j  -  g^-Jjd  +  M-)]2  (i+f)J 
j-o  x-s 

The  equation  is  somewhat  formidable  in  appearance.  Nevertheless,  it  is  different 
from  equation  (4.2.7)  only  in  the  expressions  Z  and  V,  which  are  themselves  func- 
tions only  of  t,    g,  \,    and  the  time  parameters  F,  R,  and  t .  The  behavior  of  7 

'FR 

with  respect  to  changes  in  a  and  c  is  thus  qualitatively  no  different  from  that 
described  in  sections  3,1  and  3.2. 

The  effect  of  the  interdependence  introduced  by  non-zero  values  of  g 
can  be  pinpointed  by  examining  the  behavior  of  7pR  with  respect  to  g  in  the  special 
case  where  no  trend  is  posulated  in  *%..       If  a  =  c  =  0,  then  (4.3-8)  reduces  to 
7FR  =  Z/V>  where  z  and  V  are  defined  as  above. 
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5.  An  alternate  second-stage  model 

Up  to  this  point,  it  has  "been  assumed  that  the  effects  of  random 
influences  occurring  in  a  given  period  can  be  dichotomized  into  two  parts,  con- 
sisting of  those  effects  which  persist  beyond  the  given  period  and  those  which 
do  not-  The  persistent  effects  of  random  influences  are  defined  to  be  truly 
persistent,  in  that  a  random  shock  is  assumed  to  have  the  same  effect  on  the 
realized  variable  for  all  subsequent  time. 

Since  it  may  be  desirable  to  regard  the  time-pattern  of  the  effects 
of  a  random  shock  as  other  than  the  simple  dyadic  specification  thus  far  dis- 
cussed, an  alternate  second  stage  model  is  analyzed  in  this  section.  The  model 
itself  is  presented  in  section  5«1«  The  behavior  of  7^,  given  this  model  is  then 

IK 

analyzed  in  section  5.2  and  in  section  5.3° 

5.1  Pascal-distributed  effects 

The  essence  of  the  second  stage  of  the  two-stage  model  which  has  been 

presented  in  this  paper  is  a  specification  of  the  nature  of  the  effect  of  a 

random  shock  occurring  in  any  period  upon  the  level  in  that  and  other  periods 

of  the  variable  being  analyzed.   (The  stochastic  process  generating  that 

random  shock  is  specified  by  the  first-stage  model).  To  put  this  statement 

another  way,  let  me  use  the  notation  F.  ,  (  ^.)  to  denote  the  effect  in  period 

j  +  k  of  a  random  shock  £.  occurring  in  period  j .  The  model  specified  in  section 

J 

3  can  then  be  formulated  in  the  following  way: 


t5-1-1)  »j*<*j>-ij'k-° 


\  |.  ,k  =0,  0£\^1. 
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In  many  circumstances  it  may  seem  inappropriate  to  assume  that  the 
effect  of  a  random  shock  is  identical  in  all  subsequent  periods  or  --  the  same 
thing  --  that  the  sum  of  the  effects  of  a  random  shock  on  all  subsequent  periods 
is  non-finite.  One  alternative  is  to  postulate  that  the  effect  of  a  random  shock 
continuously  declines  as  time  goes  on.  A  convenient  specification  of  this  alter- 
native is  the  exponential  decay  defined  by 

Specifying  F,  (|  )  =  (l-S  )  %  .,    it  follows  from  (5.1.2)  that 

(5.1.3)      F3+k  (^1.(1-5)5^  j 

for  k  70. 

The  model  described  by  this  equation  is  similar  to  that  defined  by  (l+.l.l)  in 
that  the  special  cases  of  each  defined  by  5  =  0  and  \   =  0  are  identical.  In 
other  respects  it  is  quite  dissimilar.  In  particular,  it  can  be  easily  shown 
that,  since  §  •<  1, 

(5»1A)  lim    £_      F    (|  )   =  % 

n-*  <»   k=0   J      J        d 

so  that  there  is  a  finite  upper  bound  on  the  sum  of  all  effects  of  the  shock. 
The  model  expressed  by  (5.1.2)  can  thus  be  viewed  as  an  allocation  over  subseq- 
quent  time  of  the  random  shock  £  „ . 

The  operational  definition  of  £  implied  by  this  model  is  substantially 

«J 

different  from  that  implied  by  the  proportional  effect  model  defined  in  (5.1.1). 
In  that  model,  the  random  shock  ^  .  was  defined  as  its  effect  in  period  j .  By 
contrast,  V  .  is  defined  in  the  exponential  decay  model  as  its  total  effect  in 
all  periodsj   its  effect  in  period  j  is  merely  (l-S  )  *\   .»  These  two  definitions 

J 


coincide  only  for  the  special  cases  where  g  and  \  equal  zero.  As  £  increases 
the  definitions  diverge.  Indeed, 

(5.1.5)  11m    F.  (£,)  =  lim    F  ..("§.)=  0 
§  -»  1   J    J     5  -*  1   J+k    J 

so  that  the  definitions  would  infinitely  diverge  if  §  were  permitted  to  equal  unity. 

The  fact  that  F .  ( V  )  asymtotically  vanishes  as  5  approaches  unity  merely  means 
J    J 

that  the  ratio  of  the  total  effect  of  ^ .  to  its  effect  in  period  j  increases  indef - 

J 

initely  as  §  goes  to  unity  --  or,  to  put  it  differently,  as  the  observed  time  series 
to  be  defined  by  (5. 1.3)  approaches  a  random  walk. 

As  this  statement  implies,  the  structure  of  the  time  series  defined  by 
an  exponential  decay  model  approaches  that  of  the  special  case  of  ( 5.1.1)  defined 
by  X  =   1,  as  S  approaches  unity.  However,  this  convergence  is  only  readily  appar- 
ent for  values  of  S  close  to  unity.  The  difference  between  the  time  pattern  of 
the  effects  of  \  ,   specified  by  (5.I.I)  and  that  specified  by  (5.1.3)  may  be 

J 

defined  as  the  difference  between  the  values  of  the  ratio  F  „  ,  (  \  .  )/F ,   (  £•  . ) 
in  each  model.  Chart  k   shows  values  of  this  difference  corresponding  to  various 
values  of  X,  and  §  .  Since  this  difference  is  simply  the  value  of  (\  -  8    ),  it 
is  evident  that  this  difference  increases  virtually  until  the  "random  walk" 
special  case  is  reached. 

The  exponential  decay  model  specified  by  (5«1.2)  is  interesting  among 
other  reasons  because  it  reduces  to  a  Markov  process.    Defining  X,,  as  before, 
as  the  sum  of  a  current  random  shock  and  the  persistent  components  of  previous 
shocks, 

(5.1.6)  x.  =  (i-s)  *t  +  £  (1-5)  t_J  X. 

*        *   j-o         J 

so  that 

(5.1.7)  xt  =  o  xt_1  +  \x   (1-S) 


There  is  a  voluminous  literature  on  Markov  processes.  For  surveys  of  the  statis- 
tical theory,  see  Billingsley  [6] [7]  and  Feller  [22  ,  chapters  15  and  16].  Some 
economic  applications  (to  marketing  problems)  are  discussed  in  Harary  and  Lipstein 
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CHART  k 


DIFFERENCES  BETWEEN  VALUES  OF  F    (  £  )/F  (  t    ) 
FOR  " $ "   AND  "\"  MODELS 


A-S 


«   A 
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The  observed  time  series  is  thus  by  this  model  a  first-order  Markov  process  so 

long  as  the  random  shocks  %    are  independently  and  identically  distributed. 

J 

In  a  number  of  applications,  it  is  unsatisfactory  to  have  to  assume 
that  the  greatest  effect  of  a  random  shock  is  at  the  time  of  its  occurrence . 
The  effect  of  random  shocks  may  take  time  to  build  up  to  a  maximum  before  begin- 
ning to  tail  off.  Consequently  it  would  be  useful  to  specify  a  more  general 
model  of  the  time  pattern  of  random  shocks :  effects . 

As  Robert  Solov  [58]  has  pointed  out,  a  natural  generalization  of 
(5. 1.3)  exists  by  virtue  of  the  fact  that  the  successive  ratios  (l/%.)[F.  ,("?.)] 
are  terms  of  the  geometric  probability  distribution,  a  special  case  of  the  Pascal 
distribution.   Consequently  we  may  write 

where  r  is  a  positive  integer.  This  expression  reduces  to  (5.1.3)  for  r  =  1. 
Since,  from  the  binomial  theorem, 

(5,1.9)  iim   -t    ("t"1)  s k  =  d-srr, 

n~>°°    k=0   \    ' 

it  is  evident  that 

n 


(5  =  1.10)        lim   2T   Fi+k  (  %>)     =     * 


n-»  °°  k=0   d"""    °  "  J 

as  in  (5.1.4).  The  Pascal  distribution  thus  provides  a  general  specification 
of  the  distribution  of  the  total  effects  of  each  random  shock  over  subsequent 
time. 

The  difference  between  the  Pascal  distribution  defined  by  different  values 

The  Pascal  distribution  is  discussed  in  Solow  [64,  pp.  394-396]  and  Feller 
[22,  pp.  155-1573"  This  distribution  can  be  regarded  as  the  result  of  r 
exponential  decay  model  cascaded  in  series j   it  is  so  discussed  in  Forrester 
[24,  chapter  9].  See  also  Theil  and  Stern  [68,  pp.  113-1153. 
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of  r  is  illustrated  in  chart  5>  which  presents  an  indication  of  the  time  pattern 
of  effects  associated  with  various  values  of  r.   For  convenience  in  presenta- 
tion F  ,  (  £  „)  is  defined  as  a  continuous  function  of  k.  The  axes  of  chart  5 
J+k   7  J 

are  defined  so  as  to  collapse  the  family  of  distributions  defined  by  different 
values  of  &   into  a  simple  distribution  for  each  given  value  of  r. 

As  chart  5  indicates,  the  maximum  effect  per  period  of  a  random  shock 
moves  outward  as  r  is  increased.  For  r  =  1,  the  maximum  effect  of  a  shock  is 
in  the  period  in  which  the  shock  occurs .  For  r  =  2,    the  immediate  effect  of  a 

random  shock  is  zero,  at  which  time  the  rate  of  change  of  F.  ,  (  ^  • )  is  a"k  a 

2 
maximum.   For  r  ?-2,  both  the  immediate  effect  of  the  shock  and  its  rate  of 

change  at  that  time  are  zero.  There  seems  little  point  in  postulating  r^3 
for  most  economic  applications,  since  the  qualitative  nature  of  the  time-pattern 
of  delayed  effects  is  specified  by  any  r^2.  For  larger,  the  resultant  distri- 
bution becomes  both  less  skewed  and  more  leptokurtic;  the  length  of  time 

before  F,  ,  (  f  . )  reaches  a  given  value  increases  (though  at  a  decreasing  rate), 
j+k    j 

the  rate  of  increase  once  a  significant  level  of  F.  ,  ( £  . )  is  reached  becomes 

e  j+k   ?  y 

larger,  and  the  rate  of  approach  to  zero  of  F    (  ^  .)  is  faster  once  the  model 
value  of  the  distribution  is  passed. 

Given  the  specification  of  F  (£  „)  defined  in  (5.1.8),  the  second- 
stage  model  can  be  quickly  obtained  as  the  sum  of  the  currently  persistent  com- 
ponents (i.e.,  the  current  effects  in  period  t)  of  both  current  and  previous 


Strictly  speaking,  this  is  of  course  not  true  for  the  Pascal  distribution, 


1  Chart  5  is  adapted  from  [2k,    Figure  9-7] 

2 

for  which  F.  (  £  ,)/(!-  S  )  | .  is  1  f  or  r  =  1,  (1-5  )  for  r  =  2,  (l-S  )'£ 

for  r  =  3  and  so  forth.  The  continuous  presentation  of  chart  5  provides  only 
an  approximate  representation  of  the  Pascal  distribution.  The  remarks  supra 
on  the  form  of  distributions  defined  by  different  values  of  r  are  consequently 
only  qualitatively  correct. 
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CHART  5 

THE  TIME  PATTERN  OF  EFFECTS  OF  RANDOM  SHOCKS 
DEFINED  BY  VARIOUS  VALUES  OF  r 


W«i> 


k(l-S) 
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random  shocks . 

Thus,  since  k  =  t  -  j, 

(5.1.11)  xt-  e    (rtr)  fc'-sfs*"1  |j  • 

The  second  stage  model  is  thus  defined  in  the  two  parameters  §  and  r. 

In  order  to  bring  out  the  implications  of  this  model,  I  shall  first 
analyze  7   in  the  case  where  r  =  1  and  then  turn  to  cases  where  r  *■!.     Since 
the  marginal  effects  of  specification  of  the  first-stage  model  as  in  sections 
k,2   and  4.3  should  not  differ  in  their  qualitative  aspects  when  applied  to  either 
second-stage  model,  I  shall  not  relax  the  assumption  that  the  ^„  are  a  pure  ran- 
dom process. 

5.2  Behavior  of  7_,n  when  r  =  1 
an 

So  long  as  |  is  a  pure  random  process,  the  behavior  of  7   can  be 

easily  determined  from  the  fact  that  X„  is  then  a  Markov  process.  As  is  well- 

J 

known  (as  for  instance  Bartlett  [4,  pp.  l44-l45]),  the  autocorrelation  function 

associated  with  a  Markov  process  defined  as  in  (5°!°7)  is  a  decreasing  exponen- 

1 
tial,   so  that 

(5.2.1)  1^  =  (RX)L  =  SL  . 

Consequently,  substituting  in  (2.5 ), 

,    rnF   „R    F+R-i 

(5=2.2)  7fr  -   ^  CS  +g  -S   3  . 

2[1  -SR] 

Hence  7   is  negative  for  0  ^ &  £-  \}    as  would  be  expected  from  the  fact  that  the 
FR 

Markov  process  defined  in  (5.1. 7)  is  stable.  As  would  moreover  be  expected  from 
the  identity  of  the  two  special  cases  defined  by  £   =  0  and  X  =  0,  7   is  equal 

Bartlett  proves  this  for  the  case  where  %   „  =  b  £"  ;   it  is  however  an  easy 

extension  to  show  that  this  result  is  also  valid  for  5=a+bf,,  a^O- 

J        ^  J 
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to  -  1/2  when  S  =0.  Perhaps  more  surprising  is  the  fact  that  7   converges 
to  -  l/2  for  any  (S^l  as  F  and  R  are  increased. 

The  relationship  between  7   and  R  for  various  values  of  S  when  F  =  R 

IK 

is  presented  in  Chart  6.  This  chart  affords  some  interesting  comparisons  with 

the  form  of  7(F,R)  defined  by  the  analogous  model  of  Section  3-l«  For  the  case 

where  a  =  0,  the  differences  in  7(F,  R)  are  brought  out  by  comparing  Chart  6  with 

Chart.  1.  As  is  clear  from  such  a  comparison,  the  differences  are  substantial. 

For  the  model  of  section  3.  v-c-o   goes  to  zero  as  R  increases  for  X^O,  while  for 

the  Markov  process  7   goes  to  -  l/2  for  S  ^~   1  as  F  and  R  increase.  This  differ- 

FR 

ence  should  be  evident  from  the  fact  that,  from  (5°1°3)> 


(5.2.3)  11m    F    (  fe  )  =  0 


The  Markov  process  specification  increasingly  approaches  that  of  the  section  3 
model  for  X  =  0  as  F  and  R  are  increased,  as  a  result  of  the  increased  evanes- 
cence of  random  shocks  specified  by  {5.2.3). 

For  non-zero  a,  the  difference  between  the  two  models  is  even  more 
marked.  As  shown  in  Section  4.1,  the  degree  of  regress ivity  in  a  series  defined 
by  (4.1.2)  for  a  given  value  of  \  decreases  as  \&\   increases.  Indeed,  for  some 
combinations  of  \s   F,  R,  a  and  b,  7__,  is  positive.  By  contrast,  t_  is  unchanged 
in  the  Markov  process  model  as  a  departs  from  zero.  This  will  be  shown  by  com- 
puting 7   directly  in  the  usual  manner,  rather  than  from  the  autocorrelation 
function.  From  (5.1. 11),  it  is  evident  that 

(5.2A)  x,  =  a(i  -St+])  +b  (1  -8)     E   £t_J  > 

t  j=0       Sj 

so  that,  obtaining  A„  and  ZV  from  (5.2.4),  substituting  in  (2.3).»  taking  expected 
values  of  product  moments,  and  closing  sums, 
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CHART  6 


RELATIONSHIP  BETWEEN  y     ,    R,  AND  8 


FB' 


R=  IO 


R  =  5" 


A. 


.2    .3    ,4-   ,5    .4   .7    ,8   .*5 


PR 


A 


■I 


.2 


B. 


-I ! I I 1 I I ! I L_ 

2     4     (o  8  tO         IZ    14-    It    18    20 
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52H  S2t+2  +  b2H  (jJs) 


(5.2,5)  7 


2R, 


2R 


— o—  (1-5")  +S~  -5 
5  -1 


2t+2 


FR 


W 


^  2t+2    2/1-S' 
C       +  b 


(i+sj 


'   C2R   S  -1   s-2E   <-2t+2 


where  H  =  (§F  -  1)   S.  -1 

<$R 

Since  0  #  £ -£  1,  plim&t  =  0  so  that  (5.2.5)  reduces  to  (5.2.2).  Though  this 
t£T 

result  may  seem  surprising^  it  follows  from  the  strict-sense  stationarity  of  a 
Markov  process  defined  such  that  S   <■     1. 

If  there  is  a  constant  linear  drift  in  E  ^ ,  as  discussed  in  section 
U.2.2.,  then  the  resulting  process  is  no  longer  strict-sense  stationary  and  is 
Markov  only  around  the  trend  consequently  defined  in  X.  Since  from  this  defini- 
tion ^   =  a  +  cj  +  b  >  .,  then,  substituting  for  ^   (5.1. 11), 

X, 


(5.2.6)      x+  =a(i+st+1)  +  ct  (1  -  S(1-^    +b  (i-s)  is*"1!;; 

t(l-S)  5=0 

which,  as  before,  reduces  to 


(5.2oT; 


X  =  a  +  ct  +  b  (1  -5  )  Z   8    X    '* 


j=0 


for  large  enough  values  of  t.  Computing  7_.  from  (5.2„7)  in  the  same  manner  as 


before,  it  is  evident  that 

2 2 

c  FR  -  b 

(5.2.8) 


'FR 


(SF-D(SR-D~^ 

I2R2  +  2b2  (1  -&R)  ±JL 


If  c  =  0,  (5.2.8)  of  course  reduces  to  (5.2.2).  For  non-zero  values  of  c,  however, 


it  is  apparent  that  7   may  be  positive  if 


(5.2.9) 


c2fr  ^  b2(5F  -  i)G5R  -  1)  ±-^8 


&' 
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as  would  be  expected  from  the  fact  that  the  unconditional  expected  values  of 
future  and  past  change  are  non-zero.  In  particular  KA  =  cF  and  EAR  =  cR. 
Equation  (5. 2 .8}  thus  represents  the  ratio  of  EA  to  ka  modified  to  allow  for 
the  variance  in  A„  and  A„  around  their  expected  values. 

5.3  Behavior  of  y—.   when  r>-l 
^ rK 

The  nature  of  j(F,R)  for  the  model  defined  by  any  positive  value  of 

r  can  perhaps  best  be  analyzed  by  comparing  it  with  the  nature  of  7(F,R)  for  the 

special  case  defined  by  r  =  1  and  discussed  in  the  previous  section.  Defining 

£ .  =  a  +  b  £.  +  cj,  as  in  the  previous  section,  (5»l.ll)  can  be  rewritten  as 

t 
(5.3.1)     X .=(l-SfZ    D(t-j)St_J  [a  +  *>  +  cj] 

*  j=0  ^J 


where  D(k)  =  ft*'1) 


If  r  =  1,  D(k)  =  1  for  all  values  of  k  and  (5°3.l)  reduces  to  (5.2A). 
In  general, 

For  any  value  of  r,  D(k)  =  1  if  k  =  0.  If  k-^0,  D(k)  t-  1  for  all  r  ^  1,  and, 

in  particular, 

(5.3.3)  D(k+1)  =(|~f)  D(k)   . 

It  is  apparent  from  (5. 3. 3)  that  both  D(k+l)/D(k)  and  the  ratio  of  D(k+l)/D(k) 
to  D(k)/D(k-l)  approach  unity  as  k  increases  for  r^-l  and  that  both  ratios  approach 
unity  for  a  given  k  as  r  approaches  unity  from  above.  Thus,  for  any  given 
(positive)  value  of  k,  the  extent  to  which  D(k)  exceeds  unity  is  increased  as  R 
is  increased.  Likewise,  for  any  two  given  non-equal  values  of  R,  the  differ- 
ence between  D(k)  for  the  higher  R  and  D(k)  for  the  lower  r  increases  as  k  is 
increased. 
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All  these  facts  make  it  easy  to  pinpoint  the  difference  between  models 
resulting  from  differently  specified  values  of  r.  Obtaining  A_  and  Ap  from 
(5.3«l)  and  obtaining  7   in  the  usual  manner,  it  turns  out  that 


(5.3A)     r 


FR 


i2  EG  +  ac(EK  +  Gj)  +  c2  JK  +  b2   (M  +  N) 
a     G     +  2acGK  +  c  K     +  b     Q 


t  t+F  ,     „    , 

where  E   =      £     [D(t+F-j )  V    -  D(t-j )  35 t_J   +     £       D(t+F-j)S  J 

j=0  j=t+l 


t-R 


biR  [D(t-j)  -  Drills*- j  +    £     DCt-jjs*--1 


j=0 


SR 


j=t-R+l 
t+F 


£     j[D(t+f-j)SF   -  D(t-j)J5t_J   +     Z.        J   D(t+F-j)S 
j=0  j=t+l 


t+F-j 


t-R 


t    ' 


E.   J  tD(t-j)  -  P^-g-J/jS1^  +    £        D(t-j)5"t_J 

j=0  §   R  j-t-ft+1 


t-R  „ 

M=      £       [D(t+F-j)SF  -  D(t-j)] 
j=0 


"D(t-j)]-2^|=ii]5a(t-J) 


N 


j=t-R+l 


[D(t+f-j)SF  -  D(t-j)]  D(t-j)S2   (t~j) 


t-R     r 


Q  = 


j=0 


t-j)  -  2fc£Lil 


52(t-J)+     £    [D(t-j)]252(t-j) 

J-fc-H+1 


If  r  =  1,    then  D(k)   =  1  for  all  k  and   (5»3«i*-)  reduces  to  (5.2„5)„     To  evaluate 
(5»3»*0  for  r?l,   we  can  rewrite  E  and  G  as 


t+1 


F-l 


(5=3.5)         E  =D(t+l)5L+X  % 

k=0 


D(t+l+k) 
D(t+1) 


h        C±     D(t+1) 


1  -  sJ 


1  -  s 
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R-i 
(5.3.6)  G  =D(t-R+l)St-R+1  £ 

k=0 


D(t-R+l+k) 
D(t-R+1) 


Sk  ~  D(t-R+l)St_R+1 


1  -s 


where  the  approximations  follow  from  the  asymptotic  behavior  of  D(K+l)/D(k) 

described  above.  It  is  clear  from  this  same  asymptotic  behavior  that  the  product 

t-R+1 
EG  will  be  arbitrarily  close  to  zero  if  [D(t-R+l)§     ]  is  arbitrarily  close  to 

zero.  It  is  clear  furthermore  that 

k+1 


(5- 3.7) 


Consequently 


lim 


o*» 


D(k+l)S- 
DUOS' 


=  s 


(5.3.8) 


lim  E  =  lim  G  =  0 
k—>    °°    k— *o° 


The  rapidity  of  convergence  of  this  limit  is  of  course  dependent  upon  £  and  r-> 
For  8  <-   0.5,  D(k^S   is  less  than  0,001  for  k>20  and  r*.J.     For  most  time  series 
to  be  analyzed^  it  should  be  safe  to  assume  that  both  E  and  G  are  arbitrarily 
close  to  zero.  We  thus  obtain 

(5.3.9) 


c2  JK  +   b2  (M+N) 

7FR  ~   2  .  2   .  2  n  > 

c  k  +  b  Q 


an  expression  which  is  dependent  only  on  c,  b,  k, ^ ,  F,  and  R.  If,  moreover, 
c  =  0  then  7„_,  =  (M+N)/Q  and  is  thus  independent  of  the  parameter  of  the  first- 
stage  process  generating  the  random  shocks. 
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6.  A  note  on  applications  to  objective  data 

In  this  section  I  shall  present  an  illustrative  application  of  the 
analysis  of  sections  3  through  5°  In  order  to  apply  this  analysis  to  objective 
data,  it  is  of  course,  necessary  to  say  something  about  the  likelihood  of  obtain- 
ing estimates  of  y       from  a  realization  of  a  particular  process  which  are  within 
some  stated  range  around  the  theoretical  values  of  7   for  that  process.  I  shall 
therefore  briefly  "say  something"  about  this  problem  in  section  6.1  before  turn- 
ing to  the  application  in  section  6.2 

6.1  The  evaluation  of  realisation  estimates 

Since  I  have  defined  "V—  as  the  expected  value  of  the  estimate  y 
defined  in  equation  (2.3),  and  since  moreover  y       asymptotically  converges  to 

rn 

its  expected  value,  it  is  clear  that  estimates  of  y       obtained  from  realizations 
of  a  given  process  discussed  in  section  3»  b,    or  5  will  converge  asymptotically 
to  the  values  derived  for  that  process.  The  likelihood  of  being  parent  to  a  given 
realization  is  thus  higher  for  one  process  than  for  another  if  the  form  of 
y   (F>,R)  defined  by  the  first  process  more  closely  accords  with  the  estimates  of 
y       computed  from  the  realization  that  does  that  defined  by  the  second.  However, 
the  difference  between  the  two  likelihoods  may  be  small,  even  if  y   (F, R)  is  quite 
different  for  the  two  processes  compared,  if  the  sampling  distribution  of  esti- 
mates for  y       is  wide  for  realizations  of  the  two  processes  containing  the  same 
number  of  observations  as  the  given  realizations.  The  fact  of  asymptotic  con- 
vergence consequently  does  not  of  itself  have  much  practical  value,  particularly 
since  the  size  of  realizations  which  can  be  obtained  from  relevant  historical 
records  is  usually  small.  It  is  the  small-sample  distributions  of  estimates 
of  y       with  which  we  must  be  concerned.  Small-sample  distributions  of  estimates 
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of  7  obtained  by  repeated  simulation  will  be  presented  in  [ll]  for  various 
FK 

processes  analyzed  in  sections  3,   **-.»  and  5° 

6.2  An  illustrative  application 

The  application  to  be  discussed  in  this  section  involves  the  analysis 

of  estimates  of  t„„  obtained  from  a  sample  of  time  series  data  rather  than  from 
'FR 

subjective  information.  The  sample  data  consist  of  a  set  of  monthly  observations 

of  total  domestic  shipments  of  carbon  black  for  the  Ik   years  19^7  through  1960„ 

Since  these  data  are  discussed  in  [6],  I  shall  not  further  comment  here,  but 

shall  instead  immediately  turn  to  an  analysis  of  the  estimates  presented  in  [8]  of 

7__  for  cases  where  F  =  R. 
'FR 

The  estimates  of  7   obtained  from  the  carbon  black  data  are  presented 

in  Chart  7  for  values  of  R  ranging  from  1  to  60-  (F  is  set  equal  to  R.)  The 

least-squares  estimates  of  the  standard  errors  of  estimate  of  each  7   are  also 

UK 

presented  in  Chart  7.<-  plotted  symetrlcally  around  null  values  of  7  . 

It  is  evident  that  a  systematic  pattern  exists  in  the  estimates  of  7 
presented  in  Chart  Jf   the  most  obvious  characteristic  of  which  is  the  positive 
sign  of  7  R  for  R>i+3»  Such  a  pattern  corresponds  with  what  a_  priori  information 
would  suggest;  namely,  that  "rational"  forecasts  of  AF  should  imply  negative 
values  of  7   in  the  short-term  for  most  industrial  series,  reflecting  the  rela- 
tlvely  large  short-term  variability  of  such  data,  but  should  imply  positive  values 
of  7       over  periods  long  enough  for  secular  trends  to  be  manifested «  Both  a  priori 
information  and  the  pattern  exhibited  in  Chart  7  make  highly  unlikely  the  possibil- 
ity that  the  carbon  black  shipments  are  a  realization  of  the  purely  random  process 
defined  by  (3° 1.2)  with  \  =  0  or  by  (5d.ll)  with  S  =  0„  Likewise  unlikely  is 
the  possibility  that  carbon  black  shipments  are  a  random  walk,  even  around  some 

trend  since  the  asymptotic  values  of  7   would  in  that  event  be  negative  for  no 

ill 


-59- 


CHART  7 

RELATIONSHIP  BETWEEN  SUCCESSIVE  CHANGES 
IN  MONTHLY  SHIPMENTS  OF  CARBON  BLACK 


SOURCE;  Table  2-2  in  [8] 
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values  of  R. 

If  it  could  be  assumed  that  random  shocks  affecting  carbon  black  ship- 
ments are  identically  and  independently  distributed,  then  it  would  be  possible 
to  restrict  analysis  to  the  model  defined  by  (4.1.3)  because  of  the  non-positive- 
ness  of  7  (F,R)  for  all  F  and  R  for  the  model  defined  by  (5.2.4).  Actually,  it 
is  quite  likely  that  there  has  been  a  drift  in  the  expected  value  of  these  random 

shocks,  making  it  useful  to  compare  the  extent  to  which  the  estimates  of  7_  for 

FR 

carbon  black  shipments  accord  with  more  complicated  models.  However,  since  the 
method  of  analysis  is  the  same,  I  shall  restrict  myself  here  to  comparing  the 
estimates  presented  in  Chart  7  with  the  7(FsR)  defined  by  various  values  of  the 
parameters  of  (4.1.3). 

Since  the  estimates  of  7   are  all  positive  and  more  than  one  standard 
error  unit  away  from  zero  for  R^45  and  all  negative  and  again  more  than  one 
standard  error  unit  away  from  zero  for  R^3^  the  coefficients  of  variation  of 
the  random  shocks  which  must  be  associated  with  given  values  of  X,  in  order  that 
7FR  be  zero  for  values  of  R  within  that  range  can  be  calculated  from  (4.1.10) 
again  setting  F  =  Re  The  values  of  b/a  thus  obtained  can  then  be  substituted 

into  (4.1.9)  along  with  the  given  values  of  X   to  generate  values  of  j__  to  com- 

FR 

pare  with  every  estimate  of  7   obtained  from  the  realization  data.  Chart  8 

r  K 

presents  the  results  of  a  set  of  such  comparisons  for  generated  values  of  7 

FR 

using  the  midpoint  and  extreme  values  of  the  range  of  R  for  which  the  carbon 

black  7   estimates  are  close  to  zero.  The  standard  of  comparison  is  the  overall 

mean  squared  difference  between  the  carbon  black  7__  estimates  and  the  generated 

FR 

values  of  7FR.  Some  of  the  sets  of  7   values  generated  for  selected  values  of  X, 
are  presented  in  Chart  9.  As  Charts  8  and  9  indicate,  the  "best  fit"  values  of  X, 
would  seem  to  be  equal,  to  or  below  0.01,  which  would  indicate  a  greater  degree 
of  randomness  in  the  shipments'  series  than  perhaps  might  be  expected  a  priori. 
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CHART  8 

MEAN  SQUARED  DIFFERENCE  BETWEEN  THE 
ESTIMATES  OF  7^  PRESENTED  IN  CHA^  7 

AND  VALUES  OF  7^  COMPUTED  FROM  (U.1.9) 

FOR  F=R=35,J+0  and  k$  AND  FOR  SELECTED 
VALUES  OF  X 


-£ez    A 


.10  - 


.05 


«? 


.4 


.S" 


,6 


.7 


,8 


,9 


1,0 


-62- 

CHART  9 

COMPARISON  OF  y       COMPUTED  FROM  (^.1.9) 

FOR  F=R=40  AND  SELECTED  VALUES  OF  \  WITH 

THE  ESTIMATES  OF  7__  FOR  CARBON  BLACK 
rn 

PRESENTED  IN  CHART  7 
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However,  it  should  be  recalled  that  in  this  model  FA,  =  aXF.  Fitting  a  constrained 
regression  on  F  to  the  mean  values  of  A„  for  the  carbon  black  data  yields  an  esti- 
mate a\  =  260,  so  that  for  \  =  0.01,  for  example,  we  would  obtain,  substituting 
a,  X,  and  V  in  (k.1.3), 

(6.2.1)  X  =  26000(1+. Olt)  +  10U   ^-   £  , 

the  variable  t  being  undefined- 

Other  models  would  be  compared  with  this  one  by  using  the  same  method 
of  analysis  and  discriminating  among  them  by  mean  squared-difference  criteria. 
Further  analysis  of  the  differences  could  yield  additional  information  that 
might  be  useful  in  specifying  the  time  series  structure  of  the  observed  data. 
For  the  model  fitted  in  the  above  paragraph,  for  instance,  analysis  of  the  mean 
squared  differences  between  observed  and  generated  7__.  over  smaller  ranges  of  R 
indicates  that  for  1  ~  R  ^10  this  mean  squared  difference  is  lowest  for  the 
model  specified  by  X  =   .30,  thus  indicating  a  substantially  lesser  degree  of  random- 
ness over  the  short  run  and  in  turn  suggesting  presence  of  some  first-order  auto- 
correlation in  the  random  shocks  \  .  Analysis  ?f  such  models  may  also  be  extended 
by  using  estimated  equations  such  as  (6.2.1)  as  a  means  of  estimating  the  random 
shocks  r"  ,  and  then  analyzing  the  interdependence  between  the  £"  „  and  other 
factors.  But  this  sort  of  analysis  begins  to  take  us  beyond  the  scope  of  this 
paper . 

7.   Conclusion 

In  a  recent  monograph  [23,  p.  59l.»  Frank  Fisher  has  included  an  apt 
quotation  from  Heraclitus;   "The  Lord  whose  oracle  is  at  Delphi  neither  reveals 
nor  conceals,  but  he  indicates  his  meaning  through  hints . "  Faced  with  the 


-6k- 

problem  of  Identifying  the  time-structure  of  economic  interdependences,  the 
econometrician  cannot  but  be  impressed  by  the  difficulties  inherent  in  inter- 
preting the  hints  contained  in  observed  data.  While  the  analysis  described 
in  this  paper  has  attempted  to  unravel  some  of  the  simpler  aspects  of  this 
specification  problem,  it  has  raised  many  more  unanswered  questions.  But  this 
perhaps  is  to  be  expected  in  the  Delphic  art  of  econometrics. 
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